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A First Course in String Theory 

Second Edition 



Barton Zwiebach is once again faithful to his goal of making string theory accessible to 
undergraduates. He presents the main concepts of string theory in a concrete and physical 
way to develop intuition before formalism, often through simplified and illustrative exam- 
ples. Complete and thorough in its coverage, this new edition now includes the AdS/CFT 
correspondence and introduces superstrings. It is perfectly suited to introductory courses 
in string theory for students with a background in mathematics and physics. 

This new edition contains completely new chapters on the AdS/CFT correspondence, 
an introduction to superstrings, and new sections covering strings on orbifolds, cos- 
mic strings, moduli stabilization, and the string theory landscape. There are almost 300 
problems and exercises, with password protected solutions available to instructors at 
www.cambridge.org/zwiebach. 

Barton Zwiebach is Professor of Physics at the Massachusetts Institute of Technology. 
His central contributions have been in the area of string field theory, where he did the 
early work on the construction of the field theory of open strings and then developed the 
field theory of closed strings. He has also made important contributions to the subjects of 
D-branes with exceptional symmetry and tachyon condensation. 

From the first edition 

‘A refreshingly different approach to string theory that requires remarkably little previous 
knowledge of quantum theory or relativity. This highlights fundamental features of the the- 
ory that make it so radically different from theories based on point-like particles. This book 
makes the subject amenable to undergraduates but it will also appeal greatly to beginning 
researchers who may be overwhelmed by the standard textbooks.’ 

Professor Michael Green, University of Cambridge 

‘Barton Zwiebach has written a careful and thorough introduction to string theory that is 
suitable for a full-year course at the advanced undergraduate level. There has been much 
demand for a book about string theory at this level, and this one should go a long way 
towards meeting that demand.’ 

Professor John Schwarz, California Institute of Technology 

‘There is a great curiosity about string theory, not only among physics undergraduates but 
also among professional scientists outside of the field. This audience needs a text that goes 
much further than the popular accounts but without the full technical detail of a graduate 



text. Zwiebach’s book meets this need in a clear and accessible manner. It is well-grounded 
in familiar physical concepts, and proceeds through some of the most timely and exciting 
aspects of the subject.’ 

Professor Joseph Polchinski, University of California, Santa Barbara 

‘Zwiebach, a respected researcher in the field and a much beloved teacher at MIT, is truly 
faithful to his goal of making string theory accessible to advanced undergraduates - the 
test develops intuition before formalism, usually through simplified and illustrative exam- 
ples . . . Zwiebach avoids the temptation of including topics that would weigh the book 
down and make many students rush it back to the shelf and quit the course.’ 

Marcelo Gleiser, Physics Today 

‘. . . well- written . . . takes us through the hottest topics in string theory research, requiring 
only a solid background in mechanics and some basic quantum mechanics . . . This is not 
just one more text in the ever-growing canon of popular books on string theory . . . ’ 

Andreas Karch, Times Higher Education Supplement 

. . the book provides an excellent basis for an introductory course on string theory and 
is well-suited for self-study by graduate students or any physicist who wants to learn the 
basics of string theory’. 

Zentralblatt MATH 

\ . .excellent introduction by Zwiebach . . . aimed at advanced undergraduates who have 
some background in quantum mechanics and special relativity, but have not necessarily 
mastered quantum field theory and general relativity yet . . . the book ... is a very thorough 
introduction to the subject . . . Equipped with this background, the reader can safely start 
to tackle the books by Green, Schwarz and Witten and by Polchinski.’ 

Marcel L. Vonk, Mathematical Reviews Clippings 



Cover illustration: a composite illustrating open string motion as we vary the strength of 
an electric field that points along the rotational axis of symmetry. There are three surfaces, 
each composed of two lobes joined at the origin and shown with the same color. Each 
surface is traced by a rotating open string that, at various times, appears as a line stretching 
from the boundary of a lobe down to the origin and then out to the boundary of the opposite 
lobe. The inner, middle, and elongated lobes arise as the magnitude of the electric field is 
increased. For further details, see Problem 19.2. 
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String theory is one of the most exciting fields in theoretical physics. This ambitious and 
speculative theory offers the potential of unifying gravity and all the other forces of nature 
and all forms of matter into one unified conceptual structure. 

String theory has the unfortunate reputation of being impossibly difficult to understand. 
To some extent this is because, even to its practitioners, the theory is so new and so ill 
understood. However, the basic concepts of string theory are quite simple and should be 
accessible to students of physics with only advanced undergraduate training. 

I have often been asked by students and by fellow physicists to recommend an intro- 
duction to the basics of string theory. Until now all I could do was point them either to 
popular science accounts or to advanced textbooks. But now I can recommend to them 
Barton Zwiebach’s excellent book. 

Zwiebach is an accomplished string theorist, who has made many important contribu- 
tions to the theory, especially to the development of string field theory. In this book he 
presents a remarkably comprehensive description of string theory that starts at the begin- 
ning, assumes only minimal knowledge of advanced physics, and proceeds to the current 
frontiers of physics. Already tested in the form of a very successful undergraduate course 
at MIT, Zwiebach’s exposition proves that string theory can be understood and appreciated 
by a wide audience. 

I strongly recommend this book to anyone who wants to learn the basics of string theory. 

David Gross 

Director, Kavli Institute For Theoretical Physics 
University of California, Santa Barbara 
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The idea of having a serious string theory course for undergraduates was first suggested to 
me by a group of MIT sophomores sometime in May of 2001. I was teaching Statistical 
Physics, and I had spent an hour-long recitation explaining how a relativistic string at high 
energies appears to approach a constant temperature (the Hagedorn temperature). I was 
intrigued by the idea of a basic string theory course, but it was not immediately clear to me 
that a useful one could be devised at this level. 

A few months later, I had a conversation with Marc Kastner, the Physics Department 
Head. In passing, I told him about the sophomores’ request for a string theory course. 
Kastner’s instantaneous and enthusiastic reaction made me consider seriously the idea for 
the first time. At the end of 2001, a new course was added to the undergraduate physics 
curriculum at MIT. In the spring term of 2002 I taught String Theory for Undergraduates 
for the first time. This book grew out of the lecture notes for that course. 

When we think about teaching string theory at the undergraduate level the main question 
is, “Can the material really be explained at this level?”. After teaching the subject two 
times, I am convinced that the answer to the question is a definite yes. Although a complete 
mastery of string theory requires a graduate-level physics education, the basics of string 
theory can be well understood with the limited tools acquired in the first two or three years 
of an undergraduate education. 

What is the value of learning string theory, for an undergraduate? By exposing the stu- 
dents to cutting-edge ideas, a course in string theory can help nurture the excitement and 
enthusiasm that led them to choose physics as a major. Moreover, students will find in string 
theory an opportunity to sharpen and refine their understanding of most of the undergradu- 
ate physics curriculum. This is valuable even for students who do not plan to specialize in 
theoretical physics. 

This book was tailored to be understandable to an advanced undergraduate. Therefore, 
I believe it will be a readable introduction to string theory for any graduate student or, in 
fact, for any physicist who wants to learn the basics of string theory. 
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Teaching string theory to a class composed largely of bright undergraduates was both a 
stimulating and a rewarding experience. I am grateful to the group of students that com- 
posed the first class: 



Jeffrey Brock 


Adam Granich 


Trisha Montalbo 


Zilong Chen 


Marketa Havlfckova 


Eugene Motoyama 


Blair Connely 


Kenneth Jensen 


Megha Padi 


Ivailo Dimov 
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Francis Lam 


James Pate 
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Gabrielle Magro 


James Smith 


Teresa Fazio 


Sourav Mandal 


Morgan Sonderegger 


Caglar Girit 
Donglai Gong 


Stefanos Marnerides 


David Starr 



They were enthusiastic, funny, and lively. My lectures were voice-recorded and three 
of the students, Gabrielle Magro, Megha Padi, and David Starr, turned the tapes and the 
blackboard equations into DTjgX files. I am grateful to the three of them for their dedication 
and for the care they took in creating accurate files. They provided the impetus to start the 
process of writing a book. I edited the files to produce lecture notes. 

Additional files for a set of summer lectures were created by Gabrielle and Megha. In 
the next six months the lecture notes became the draft for a book. After teaching the course 
for a second time in the spring term of 2003 and a long summer of edits and revisions, the 
book was completed in October 2003. 

By the time the lecture notes had become a book draft, David Starr offered to read it 
critically. He basically marked every paragraph, suggesting improvements in the exposition 
and demonstrating an uncanny ability to spot weak points. His criticism forced me to go 
through major rewriting. His input was tremendous. Whatever degree of clarity has been 
achieved, it is in no small measure thanks to his effort. 

I am delighted to acknowledge help and advice from my friend and colleague Jeffrey 
Goldstone. He shared generously his understanding of string theory, and several sections 
in this book literally grew out of his comments. He helped me teach the course the second 
time that it was offered. While doing so, he offered perceptive criticism of the whole text. 
He also helped improve many of the problems, for which he wrote elegant solutions. 

The input of my friend and collaborator Ashoke Sen was critical. He believed that string 
theory could be taught at a basic level and encouraged me to try to do it. I consulted 
repeatedly with him about the topics to be covered and about the strategies to present them. 
He kindly read the first full set of lecture notes and gave invaluable advice that helped shape 
the form of this book. 

The help and interest of many people made writing this book a very pleasant task. For 
detailed comments on all of its content 1 am indebted to Chien-Hao Liu and to James 
Stasheff. Alan Dunn and Blake Stacey helped test the problems that could not be assigned 
in class. Jan Troost was a sounding board and provided advice and criticism. I’ve relied 
on the knowledge of my string theory colleagues - Amihay Hanany, Daniel Freedman, 
and Washington Taylor. I’d like to thank Philip Argyres, Andreas Karch, and Frieder Lenz 
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for testing the lecture notes with their students. Juan Maldacena and Samir Mathur pro- 
vided helpful input on the subject of string thermodynamics and black holes. Boris Kors, 
Fernando Quevedo, and Angel Uranga helped and advised on the subject of string phe- 
nomenology. Thanks are also due to Tamsin van Essen, editor at Cambridge, for her advice 
and her careful work during the entire publishing process. 

Finally, I would like to thank my wife Gaby and my children Cecile, Evy, Margaret, and 
Aaron. At every step of the way I was showered with their love and support. Cecile and 
Evy read parts of the manuscript and advised on language. Questions on string theory from 
Gaby and Margaret tested my ability to explain. Young Aaron insisted that a ghost sitting 
on a string would make a perfect cover page, but we settled for strings moving in electric 
fields. 



Barton Zwiebach 

Cambridge, Massachusetts, 2003 
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It has been almost five years since I finished writing the first edition of A First Course 
in String Theory. I have since taught the undergraduate string theory course at MIT three 
times, and I have received comments and suggestions from colleagues all over the world. I 
have learned what parts of the book are most challenging for the students, and I have heard 
requests for extra material. 

As in the first edition, the book is broadly divided into Part I (Basics) and Part II (Devel- 
opments). In this second edition I have improved the clarity of many arguments and the 
general readability of Part I. This part is studied by the largest number of readers, many of 
them independently and outside of the classroom setting. The changes should make study 
easier. There are more figures and the number of problems has been increased to better 
cover the range of ideas developed in the text. Part I has five new sections and one new 
chapter. The new sections discuss the classical motion of closed strings, cosmic strings, 
and orbifolds. The new chapter, Chapter 14, is the last one of Part I. It explains the basics 
of superstring theory. 

Part II has changed as well. The ordering of chapters has been altered to bring T- 
duality earlier into the book. The material relevant to particle physics has been collected 
in Chapter 2 1 and includes a new section on moduli stabilization and the landscape. 
Chapter 23 is new and is entirely devoted to strong interactions and the AdS/CFT cor- 
respondence. I aim to give there a gentle introduction to this lively area of research. The 
number of chapters in the book has gone from twenty-three to twenty-six, a nice number 
to end a book on string theory ! 

I want to thank Hong Liu and Juan Maldacena for helpful input on the subject of 
AdS/CFT. Many thanks are also due to Alan Guth, who helped me teach the string the- 
ory course in the spring term of 2007. He tested many of the new problems and offered 
very valuable criticism of the text. 



About this book 



A First Course in String Theory should be accessible to anyone who has been exposed to 
special relativity, basic quantum mechanics, electromagnetism, and introductory statistical 
physics. Some familiarity with Lagrangian mechanics is useful but not indispensable. 

Except for the introduction, all chapters contain exercises and problems. The exercises, 
called Quick calculations, are inserted at various points throughout the text. They are con- 
trol calculations that are expected to be straightforward. Undue difficulty in carrying them 
out may indicate problems understanding the material. The problems at the end of the 
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chapters are more challenging and sometimes develop new ideas. A problem marked with 
a dagger' is one whose results are cited later in the text. A mastery of the material requires 
solving all the exercises and many of the problems. All the problems should be read, at 
least. 

Throughout most of the book the material is developed in a self-contained way, and very 
little must be taken on faith. Chapters 14, 21, 22, and 23 contain a few sections that address 
subjects of much interest for which a full explanation cannot be provided at the level of this 
book. The reader will be asked to accept some reasonable facts at face value, but otherwise 
the material is developed logically and should b e fully understandable. These sections are 
not addressed to experts. 

This book has two parts. Part 1 is called “Basics,” and Part II is called “Developments.” 
Part I begins with Chapter 1 and concludes with Chapter 14. Part II comprises the rest of 
the book: it begins with Chapter 15 and it ends with Chapter 26. 

Chapter 1 serves as an introduction. Chapter 2 reviews special relativity, but it also intro- 
duces concepts that are likely to be new: light-cone coordinates, light-cone energy, compact 
extra dimensions, and orbifolds. In Chapter 3 we review electrodynamics and its manifestly 
relativistic formulation. We make some comments on general relativity and study the effect 
of compact dimensions on the Planck length. We are able at this point to examine the excit- 
ing possibility that large extra dimensions may exist. Chapter 4 uses nonrelativistic strings 
to develop some intuition, to review the Lagrangian formulation of mechanics, and to intro- 
duce terminology. Chapter 5 uses the relativistic point particle to prepare the ground for 
the study of the relativistic string. The power and elegance of the Lagrangian formulation 
become evident at this point. The first encounter with string theory happens in Chapter 6, 
which deals with the classical dynamics of the relativistic string. This is a very important 
chapter, and it must be understood thoroughly. Chapter 7 solidifies the understanding of 
string dynamics through the detailed study of string motion, both for open and for closed 
strings. It includes a section on cosmic strings, a topic of potential experimental relevance. 
Chapters 1 through 7 could comprise a mini-course in string theory. 

Chapters 8 through 1 1 prepare the ground for the quantization of relativistic strings. In 
Chapter 8, one learns how to calculate conserved quantities, such as the momentum and 
the angular momentum of free strings. Chapter 9 gives the light-cone gauge solution of 
the string equations of motion and introduces the terminology that is used in the quantum 
theory. Chapter 10 explains the basics of quantum fields and particle states, with emphasis 
on the counting of the parameters that characterize scalar field states, photon states, and 
graviton states. In Chapter 1 1 we perform the light-cone gauge quantization of the rela- 
tivistic particle. It all comes together in Chapter 12, another important chapter that should 
be understood thoroughly. This chapter presents the light-cone gauge quantization of the 
open relativistic string. The critical dimension is obtained and photon states are shown to 
emerge. Chapter 12 contains a section on the subject of tachyon condensation. Chapter 13 
discusses the quantization of closed strings and the emergence of graviton states. It also 
contains two sections that deal with quantum closed strings on the simplest orbifold, the 
half-line. Chapter 14 is the last chapter of Part I. It introduces the subject of superstrings. 
The Ramond and Neveu-Schwarz sectors of open strings are presented and combined to 
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obtain a supersymmetric theory. The chapter concludes with a brief discussion of type II 
closed string theories. 

The first part of this book can be characterized as an uphill road that leads to the quantiza- 
tion of the string at the summit. In the second part of this book the climb is over. The pace 
slows down a little, and the material elaborates upon previously introduced ideas. In Part II 
one reaps many rewards for the effort exerted in Part I. 

The first chapter in Part II, Chapter 15, deals with the important subject of open strings 
on various D-brane configurations. The discussion of orientifolds has been relegated to the 
problems at the end of the chapter. Chapter 16 introduces the concept of string charge and 
demonstrates that the endpoints of open strings carry Maxwell charge. The next four chap- 
ters are organized around the fascinating subject of T-duality. Chapters 17 and 18 present 
the T-duality properties of closed and open strings, respectively. Chapter 19 studies D- 
branes with electromagnetic fields, using T-duality as the main tool. Chapter 20 introduces 
the general framework of nonlinear electrodynamics. It demonstrates that electromagnetic 
fields in string theory are governed by Born-Infeld theory, a nonlinear theory in which the 
self-energy of point charges is finite. 

String models of particle physics are considered in Chapter 21. This chapter explains 
in detail the particle content of the Standard Model and discusses one approach, based 
on intersecting D6-branes, to the construction of a realistic string model. The chapter 
concludes with some material on moduli stabilization and the landscape. 

Chapter 22 begins with string thermodynamics, followed by the subject of black hole 
entropy. It presents string theory attempts to derive the entropy of Schwarzschild black 
holes and the successful derivation of the entropy for a supersymmetric black hole. The 
applications of string theory to strong interactions are studied in Chapter 23. After a dis- 
cussion of Regge trajectories and the quark-antiquark potential, the subject turns to the 
AdS/CFT correspondence. The correspondence is discussed in some detail, with emphasis 
on the geometry of AdS spaces. A section on the quark-gluon plasma is included. 

Chapter 24 gives an introduction to the Lorentz covariant quantization of strings. It also 
introduces the Polyakov string action. The last two chapters in the book, Chapters 25 
and 26, examine string interactions. We learn that the string diagrams which represent 
the processes of string interactions are Riemann surfaces. These two chapters assume a lit- 
tle familiarity with complex variables and have a mathematical flavor. One important goal 
here is to provide insight into the absence of ultraviolet divergences in string theory, the 
fact that made string theory the first candidate for a theory of quantum gravity. 

In this book I have tried to emphasize the connections with ideas that students have learned 
before. The quantization of strings is described as the quantization of an infinite number 
of oscillators. String charge is visualized as a Maxwell current. The effects of Wilson lines 
on circles are compared with the Bohm-Aharonov effect. The modulus of an annulus is 
related to the capacitance of a cylindrical conductor, and so forth and so on. The treatment 
of topics is generally explicit and detailed, with formalism kept to a minimum. 

The choice was made to use the light-cone gauge to quantize the strings. This approach 
to quantization can be understood in full detail by students with some prior exposure to 
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quantum mechanics. The same is not true for the Lorentz covariant quantization of strings, 
where states of negative norms must be dealt with, the Hamiltonian vanishes, and there is 
no conventional looking Schrodinger equation. The light-cone approach suffices for most 
physical problems and, in fact, simplifies the treatment of several questions. 



This book as a textbook 



Part 1 of the book is structured tightly. Little can be omitted without hampering the under- 
standing of string quantization. The first chapter in Part II (on D-branes) is important 
for much of the later material. Many choices among the remaining chapters are possible. 
Different readers/instructors may take different routes. 

My experience suggests that the complete book can be covered in a full-year course at 
the undergraduate level. In a school with an academic year composed of three quarters, 
Part I and four chapters from Part II may be covered in two quarters. In a school with an 
academic year composed of two semesters. Part I and two chapters from Part II may be 
covered in one semester. In either case, the choice of chapters from Part II is a matter of 
taste. Chapters 21, 22, and 23 give an appreciation for current research in string theory. 
Lecturers who prefer to focus on T-duality and its implications will cover as much as 
possible from Chapters 17-20. If this book is used to teach exclusively to graduate students, 
the pace can be quickened considerably. 

An updated list of corrections can be found at http://xserver.lns.mit.edu/~zwiebach/ 
firstcourse.html. Solutions to the problems in the book are available to lecturers via 
solutions @ Cambridge . org. 



BASICS 




Here we meet string theory for the first time. We see how it fits into the historical 
development of physics, and how it aims to provide a unified description of all 
fundamental interactions. 



( 



1.1 The road to unification 



Over the course of time, the development of physics has been marked by unifications: 
events when different phenomena were recognized to be related and theories were adjusted 
to reflect such recognition. One of the most significant of these unifications occurred in the 
nineteenth century. 

For a while, electricity and magnetism had appeared to be unrelated physical phenom- 
ena. Electricity was studied first. The remarkable experiments of Henry Cavendish were 
performed in the period from 1771 to 1773. They were followed by the investigations of 
Charles Augustin de Coulomb, which were completed in 1785. These works provided a 
theory of static electricity, or electrostatics. Subsequent research into magnetism, however, 
began to reveal connections with electricity. In 1819 Hans Christian Oersted discovered 
that the electric current on a wire can deflect the needle of a compass placed nearby. 
Shortly thereafter, Jean-Baptiste Biot and Felix Savart (1820) and Andre-Marie Ampere 
(1820-1825) established the rules by which electric currents produce magnetic fields. A 
crucial step was taken by Michael Faraday (1831), who showed that changing magnetic 
fields generate electric fields. Equations that described all of these results became available, 
but they were, in fact, inconsistent. It was James Clerk Maxwell (1865) who constructed a 
consistent set of equations by adding a new term to one of the equations. Not only did this 
term remove the inconsistencies, but it also resulted in the prediction of electromagnetic 
waves. For this great insight, the equations of electromagnetism (or electrodynamics) are 
now called “Maxwell’s equations.” These equations unify electricity and magnetism into 
a consistent whole. This elegant and aesthetically pleasing unification was not optional. 
Separate theories of electricity and magnetism would be inconsistent. 

Another fundamental unification of two types of phenomena occurred in the late 1960s, 
about one-hundred years after the work of Maxwell. This unification revealed the deep 
relationship between electromagnetic forces and the forces responsible for weak interac- 
tions. To appreciate the significance of this unification it is necessary first to review the 
main developments that occurred in physics since the time of Maxwell. 
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An important change of paradigm was triggered by Albert Einstein’s special theory of 
relativity. In this theory one finds a striking conceptual unification of the separate notions of 
space and time. Different from a unification of forces, the merging of space and time into a 
spacetime continuum represented a new recognition of the nature of the arena where phys- 
ical phenomena take place. Newtonian mechanics was replaced by relativistic mechanics, 
and older ideas of absolute time were abandoned. Mass and energy were shown to be 
interchangeable. 

Another change of paradigm, perhaps an even more dramatic one, was brought forth by 
the discovery of quantum mechanics. Developed by Erwin Schrodinger, Werner Heisen- 
berg, Paul Dirac and others, quantum theory was verified to be the correct framework 
to describe microscopic phenomena. In quantum mechanics classical observables become 
operators. If two operators fail to commute, the corresponding observables cannot be mea- 
sured simultaneously. Quantum mechanics is a framework, more than a theory. It gives the 
rules by which theories must be used to extract physical predictions. 

In addition to these developments, four fundamental forces had been recognized to exist 
in nature. Let us have a brief look at them. 

One of them is the force of gravity. This force has been known since antiquity, but it was 
first described accurately by Isaac Newton. Gravity underwent a profound reformulation in 
Albert Einstein’s theory of general relativity. In this theory, the spacetime arena of special 
relativity acquires a life of its own, and gravitational forces arise from the curvature of this 
dynamical spacetime. Einstein’s general relativity is a classical theory of gravitation. It is 
not formulated as a quantum theory. 

The second fundamental force is the electromagnetic force. As we discussed above, 
the electromagnetic force is well described by Maxwell’s equations. Electromagnetism, or 
Maxwell theory, is formulated as a classical theory of electromagnetic fields. As opposed 
to Newtonian mechanics, which was modified by special relativity. Maxwell theory is fully 
consistent with special relativity. 

The third fundamental force is the weak force. This force is responsible for the process 
of nuclear beta decay, in which a neutron decays into a proton, an electron, and an anti- 
neutrino. In general, processes that involve neutrinos are mediated by weak forces. While 
nuclear beta decay had been known since the end of the nineteenth century, the recognition 
that a new force was at play did not take hold until the middle of the twentieth century. 
The strength of this force is measured by the Fermi constant. Weak interactions are much 
weaker than electromagnetic interactions. 

Finally, the fourth force is the strong force, nowadays called the color force. This force 
is at play in holding together the constituents of the neutron, the proton, the pions, and 
many other subnuclear particles. These constituents, called quarks, are held so tightly by 
the color force that they cannot be seen in isolation. 

We are now in a position to return to the subject of unification. In the late 1960s the 
Weinberg-Salam model of elect row eak interactions put together electromagnetism and the 
weak force into a unified framework. This unified model was neither dictated nor justified 
only by considerations of simplicity or elegance. It was necessary for a predictive and con- 
sistent theory of the weak interactions. The theory is initially formulated with four massless 
particles that carry the forces. A process of symmetry breaking gives mass to three of these 
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particles: the W + , the W~, and the Z°. These particles are the carriers of the weak force. 
The particle that remains massless is the photon, which is the carrier of the electromagnetic 
force. 

Maxwell’s equations, as we discussed before, are equations of classical electromag- 
netism. They do not provide a quantum theory. Physicists have discovered quantization 
methods, which can be used to turn a classical theory into a quantum theory - a theory 
that can be calculated using the principles of quantum mechanics. While classical electro- 
dynamics can be used confidently to calculate the transmission of energy in power lines 
and the radiation patterns of radio antennas, it is neither an accurate nor a correct theory 
for microscopic phenomena. Quantum electrodynamics (QED), the quantum version of 
classical electrodynamics, is required for correct computations in this arena. In QED, the 
photon appears as the quantum of the electromagnetic field. The theory of weak interac- 
tions is also a quantum theory of particles, so the correct, unified theory is the quantum 
electroweak theory. 

The quantization procedure is also successful in the case of the strong color force, and 
the resulting theory has been called quantum chromodynamics (QCD). The carriers of the 
color force are eight massless particles. These are colored gluons, and just like the quarks, 
they cannot be observed in isolation. The quarks respond to the gluons because they carry 
color. Quarks can come in three colors. 

The electroweak theory together with QCD form the Standard Model of particle physics. 
In the Standard Model there is some interplay between the electroweak sector and the 
QCD sector because some particles feel both types of forces. But there is no real and 
deep unification of the weak force and the color force. The Standard Model summarizes 
completely the present knowledge of particle physics. So, in fact, we are not certain about 
any possible further unification. 

In the Standard Model there are twelve force carriers: the eight gluons, the W + , the W~, 
the Z°, and the photon. All of these are bosons. There are also many matter particles, all of 
which are fermions. The matter particles are of two types: leptons and quarks. The leptons 
include the electron e~, the muon fi~ , the tau r~, and the associated neutrinos v e , v jL , and 
v T . We can list them as 

leptons: e~, r~, v e , v^, v r . 

Since we must include their antiparticles, this adds up to a total of twelve leptons. The 
quarks carry color charge, electric charge, and can respond to the weak force as well. 
There are six different types of quarks. Poetically called flavors, these types are: up (u), 
down (d), charm (c), strange ( s ), top (f), and bottom ( b ). We can list them as 

quarks: u,d,c,s,t,b. 

The u and d quarks, for example, carry different electric charges and respond differently 
to the weak force. Each of the six quark flavors listed above comes in three colors, so this 
gives 6x3= 18 particles. Including the antiparticles, we get a total of 36 quarks. Adding 
leptons and quarks together we have a grand total of 48 matter particles. Adding matter 
particles and force carriers together we have a total of 60 particles in the Standard Model. 

Despite the large number of particles it describes, the Standard Model is reasonably 
elegant and very powerful. As a complete theory of physics, however, it has two significant 
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shortcomings. The first one is that it does not include gravity. The second one is that it 
has about twenty parameters that cannot be calculated within its framework. Perhaps the 
simplest example of such a parameter is the dimensionless (or unit-less) ratio of the mass 
of the muon to the mass of the electron. The value of this ratio is about 207, and it must be 
put into the model by hand. 

Most physicists believe that the Standard Model is only a step towards the formulation of 
a complete theory of physics. A large number of physicists also suspect that some unifica- 
tion of the electroweak and strong forces into a Grand Unified Theory (GUT) will prove to 
be correct. At present, however, the unification of these two forces appears to be optional. 

Another attractive possibility is that a more complete version of the Standard Model 
includes supersymmetry. Supersymmetry is a symmetry that relates bosons to fermions. 
Since all matter particles are fermions and all force carriers are bosons, this remarkable 
symmetry unifies matter and forces. In a theory with supersymmetry, bosons and fermions 
appear in pairs of equal mass. The particles of the Standard Model do not have this property, 
so supersymmetry, if it exists in nature, must be spontaneously broken. Supersymme- 
try is such an appealing symmetry that many physicists believe that it will eventually be 
discovered. 

While the above extensions of the Standard Model may or may not occur, it is clear that 
the inclusion of gravity into the particle physics framework is not optional. Gravity must 
be included, with or without unification, if one is to have a complete theory. The effects of 
the gravitational force are presently quite negligible at the microscopic level, but they are 
crucial in studies of cosmology of the early universe. 

There is, however, a major problem when one attempts to incorporate gravitational 
physics into the Standard Model. The Standard Model is a quantum theory, while Einstein’s 
general relativity is a classical theory. It seems very difficult, if not altogether impossible, 
to have a consistent theory that is partly quantum and partly classical. Given the successes 
of quantum theory, it is widely believed that gravity must be turned into a quantum theory. 
The procedures of quantization, however, encounter profound difficulties in the case of 
gravity. The resulting theory of quantum gravity appears to be ill-defined. As a practical 
matter, in many circumstances one can work confidently with classical gravity coupled to 
the Standard Model. For example, this is done routinely in present-day descriptions of the 
universe. A theory of quantum gravity is necessary, however, to study physics at times very 
near to the Big Bang, and to study certain properties of black holes. Formulating a quan- 
tum theory that includes both gravity and the other forces seems fundamentally necessary. 
A unification of gravity with the other forces might be required to construct this complete 
theory. 



1.2 String theory as a unified theory of physics 



String theory is an excellent candidate for a unified theory of all forces in nature. It is also 
a rather impressive prototype of a complete theory of physics. In string theory all forces 
are truly unified in a deep and significant way. In fact, all the particles are unified. String 
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The decay a -*■ ft + y as a particle process (left) and as a string process (right). 



theory is a quantum theory, and, because it includes gravitation, it is a quantum theory 
of gravity. Viewed from this perspective, and recalling the failure of Einstein’s gravity to 
yield a quantum theory, one may conclude that in string theory all other interactions are 
necessary for the consistency of the quantum gravitational sector! While it may be difficult 
to measure the effects of quantum gravity directly, a theory of quantum gravity such as 
string theory may have testable predictions concerning the other interactions. 

Why is string theory a truly unified theory? The reason is simple and goes to the heart 
of the theory. In string theory, each particle is identified as a particular vibrational mode 
of an elementary microscopic string. A musical analogy is very apt. Just as a violin string 
can vibrate in different modes and each mode corresponds to a different sound, the modes 
of vibration of a fundamental string can be recognized as the different particles we know. 
One of the vibrational states of strings is the graviton, the quantum of the gravitational 
field. Since there is just one type of string, and all particles arise from string vibrations, 
all particles are naturally incorporated into a single theory. When we think in string theory 
of a decay process a -» ft + y, where an elementary particle a decays into particles ft 
and y, we imagine a single string vibrating in such a way that it is identified as particle 
a that breaks into two strings that vibrate in ways that identify them as particles ft and 
y (Figure 1.1). Since strings may turn out to be extremely tiny, it may be difficult to observe 
directly the string-like nature of particles. 

Are we sure that string theory is a good quantum theory of gravity? There is no complete 
certainty yet, but the evidence is very good. Indeed, the problems that occur when one tries 
to quantize Einstein’s theory do not seem to appear in string theory. 

For a theory as ambitious as string theory, a certain degree of uniqueness is clearly desir- 
able. It would be somewhat disappointing to have several consistent candidates for a theory 
of all interactions. The first sign that string theory is rather unique is that it does not have 
adjustable dimensionless parameters. As we mentioned before, the Standard Model of par- 
ticle physics has about twenty parameters that must be adjusted to some precise values. A 
theory with adjustable dimensionless parameters is not really unique. When the parameters 
are set to different values one obtains different theories with potentially different predic- 
tions. String theory has one dimensionful parameter, the string length t s . Its value can be 
roughly imagined as the typical size of strings. 

Another intriguing sign of the uniqueness of string theory is the fact that the dimen- 
sionality of spacetime is fixed. Our physical spacetime is four-dimensional, with one time 
dimension and three space dimensions. In the Standard Model this information is used 
to build the theory, it is not derived. In string theory, on the other hand, the number of 
spacetime dimensions emerges from a calculation. The answer is not four, but rather ten. 
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Some of these dimensions may hide from plain view if they curl up into a space that is 
small enough to escape detection in experiments done with low energies. If string theory 
is correct, some mechanism must ensure that the observable dimensionality of spacetime 
is four. 

The lack of adjustable dimensionless parameters is a sign of the uniqueness of string 
theory: it means that the theory cannot be deformed or changed continuously by changing 
these parameters. But there could be other theories that cannot be reached by continuous 
deformations. So how many string theories are there? 

Let us begin by noting two broad subdivisions. There are open strings and there are closed 
strings. Open strings have two endpoints, while closed strings have no endpoints. One 
can consider theories with only closed strings and theories with both open and closed 
strings. Since open strings generally can close to form closed strings, we do not consider 
theories with only open strings. The second subdivision is between bosonic string theories 
and superstring theories. Bosonic strings live in 26 dimensions, and all of their vibrations 
represent bosons. Since they lack fermions, bosonic string theories are not realistic. They 
are, however, much simpler than the superstrings, and most of the important concepts in 
string theory can be explained in the context of bosonic strings. The superstrings live in ten- 
dimensional spacetime, and their spectrum of states includes bosons and fermions. In fact, 
these two sets of particles are related by supersymmetry. Supersymmetry is therefore an 
important ingredient in string theory. All realistic models of string theory are built from 
superstrings. In all string theories the graviton appears as a vibrational mode of closed 
strings. In string theory gravity is unavoidable. 

By the mid 1980s five ten-dimensional superstring theories were known to exist. In the 
years that followed, many interrelations between these theories were found. Moreover, 
another theory was discovered by taking a certain strong coupling limit of one of the 
superstrings. This theory is eleven-dimensional and has been dubbed M-theory, for lack 
of a better name. It has now become clear that the five superstrings and M-theory are only 
facets or different limits of a single unique theory ! At present, this unique theory remains 
fairly mysterious. It is not yet clear whether or not the set of bosonic string theories is 
connected to the web of superstring theories. 

All in all, we see that string theory is a truly unified and possibly unique theory. It is a 
candidate for a unified theory of physics, a theory Albert Einstein tried to find ever since his 
discovery of general relativity. Einstein would have been surprised, or perhaps disturbed, 
by the prominent role that quantum mechanics plays in string theory. But string theory 
appears to be a worthy successor of general relativity. It is almost certain that string theory 
will give rise to a new conception of spacetime. The prominence of quantum mechanics in 
string theory would not have surprised Paul Dirac. His writings on quantization suggest that 
he felt that deep quantum theories arise from the quantization of classical physics. This is 
precisely what happens in string theory. This book will explain in detail how string theory, 
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at least in its simplest form, is nothing but the quantum mechanics of classical relativistic 
strings. 



1.3 String theory and its verification 



It should be said at the outset that, as of yet, there has been no experimental verification 
of string theory. In order to have experimental verification one needs a sharp prediction. 
It has been difficult to obtain such a prediction. String theory is still at an early stage 
of development, and it is not so easy to make predictions with a theory that is not well 
understood. Still, some interesting possibilities have emerged. 

As we mentioned earlier, superstring theory requires a ten-dimensional spacetime: one 
dimension of time and nine of space. If string theory is correct, extra spatial dimensions 
must exist, even if we have not seen them yet. Can we test the existence of these extra 
dimensions? If the extra dimensions are the size of the Planck length £p (the length scale 
associated with four-dimensional gravity), they will remain beyond direct detection, per- 
haps forever. Indeed, iy< ~ 10 -33 cm, and this distance is many orders of magnitude smaller 
than 10“ 16 cm, which is roughly the smallest distance that has been explored with particle 
accelerators. This scenario was deemed to be most likely. It was assumed that in string 
theory the length scale i s coincides with the Planck length, in which case extra dimensions 
would be of Planck length, as well. 

It turns out, however, that string theory allows extra dimensions that are as large as 
a tenth of a millimeter! Surprisingly, extra dimensions that large may have gone unde- 
tected. To make this work out, the string length l s is taken to be of the order of 10“ 18 cm. 
Moreover, our three-dimensional space emerges as a hypersurface embedded inside the 
nine-dimensional space. The hypersurface, or higher-dimensional membrane, is called a 
D-brane. D-branes are real, physical objects in string theory. In this setup, the presence 
of large extra dimensions is tested by gravitational experiments. Extra dimensions much 
larger than l p but still very small may be detected with particle accelerators. If extra dimen- 
sions are detected, this would be strong evidence for string theory. We discuss the subject 
of large extra dimensions in Chapter 3. 

A striking confirmation of string theory may result from the discovery of a cosmic string. 
Left-over from early universe processes, a cosmic string can stretch across the observable 
universe and may be detected via gravitational lensing or, more indirectly, through the 
detection of gravitational waves. No cosmic strings have been detected to date, but the 
searches have not been exhaustive and they continue. If found, a cosmic string must be 
studied in detail to confirm that it is a string from string theory and not the kind of string 
that can arise from conventional theories of particle physics. We discuss the subject of 
cosmic strings in Chapter 7. 

Another interesting possibility has to do with supersymmetry. If we start with a 
ten-dimensional superstring theory and compactify the six extra dimensions, the result- 
ing four-dimensional theory is, in many cases, supersymmetric. No unique predictions 
have emerged for the specific details of the four-dimensional theory, but supersymmetry 



10 



A brief introduction 



may be a rather generic feature. An experimental discovery of supersymmetry in future 
accelerators would suggest very strongly that string theory is on the right track. 

Leaving aside predictions of new phenomena, we must ask whether the Standard Model 
emerges from string theory. It should, since string theory is supposed to be a unified the- 
ory of all interactions, and it must therefore reduce to the Standard Model for sufficiently 
low energies. While string theory certainly has room to include all known particles and 
interactions, and this is very good news indeed, no one has yet been able to show that 
they actually emerge in fine detail. In Chapter 21 we will study some models which use 
D-branes and have an uncanny resemblance to the world as we know it. In these models the 
particle content is in fact precisely that of the Standard Model (the particles are obtained 
with zero mass, however, and it is not clear whether the process that gives them mass can 
work out correctly). Our four-dimensional world is part of the D-branes, but these D-branes 
happen to have more than three spatial directions. The additional D-brane dimensions are 
wrapped on the compact space (we will learn how to imagine such configurations!). The 
gauge bosons and the matter particles in the model arise from vibrations of open strings 
that stretch between D-branes. As we will learn, the endpoints of open strings must remain 
attached to the D-branes. If you wish, the musical analogy for strings is improved. Just as 
the strings of a violin are held stretched by pegs, the D-branes hold fixed the endpoints 
of the open strings whose lowest vibrational modes could represent the particles of the 
Standard Model! 

String theory shares with Einstein’s gravity a problematic feature. Einstein’s equations 
of gravitation admit many cosmological solutions. Each solution represents a consistent 
universe, but only one of them represents our observable universe. It is not easy to explain 
what selects the physical solution, but in cosmology this is done using arguments based 
on initial conditions, symmetry, and simplicity. The smaller the number of solutions a 
theory has, the more predictive it is. If the set of solutions is characterized by continuous 
parameters, selecting a solution is equivalent to adjusting the values of the parameters. 
In this way, a theory whose formulation requires no adjustable parameters may generate 
adjustable parameters through its solutions! It seems clear that in string theory the set of 
solutions (string models) is characterized by both discrete and continuous parameters. 

In order to reproduce the Standard Model it seems clear that the string model must 
not have continuous parameters; such parameters imply the existence of massless fields 
that have not been observed. It was not easy to find models without continuous parame- 
ters, but that became possible recently in the context of flux compactifications; models in 
which the extra dimensions are threaded by analogs of electric and magnetic fields. There 
is an extraordinary large number of such models, certainly more than lO 500 of them. There 
may be even more models that manage to avoid continuous parameters by other means. 
Physicists speak of a vast landscape of string solutions or models. 

In this light we can wonder what are the possible outcomes of the search for a realistic 
string model. One possible outcome (the worst one) is that no string model in the land- 
scape reproduces the Standard Model. This would rule out string theory. Another possible 
outcome (the best one) is that one string model reproduces the Standard Model. Moreover, 
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the model represents a well-isolated point in the landscape. The parameters of the Stan- 
dard Model are thus predicted. The landscape may be so large that a strange possibility 
emerges: many string models with almost identical properties all of which are consistent 
with the Standard Model to the accuracy that it is presently known. In this possibility there 
is a loss of predictive power. Other outcomes may be possible. 

String theorists sometimes say that string theory has already made at least one successful 
prediction : it predicted gravity! (I heard this from John Schwarz.) There is a bit of jest in 
saying so - after all, gravity is the oldest known force in nature. I believe, however, that 
there is a very substantial point to be made here. String theory is the quantum mechanics of 
a relativistic string. In no sense whatsoever is gravity put into string theory by hand. It is a 
complete surprise that gravity emerges in string theory. Indeed, none of the vibrations of the 
classical relativistic string correspond to the particle of gravity. It is a truly remarkable fact 
that we find the particle of gravity among the quantum vibrations of the relativistic string. 
You will see in detail how this happens as you progress through this book. The striking 
quantum emergence of gravitation in string theory has the full flavor of a prediction. 



1.4 Developments and outlook 



String theory has been a very stimulating and active area of research ever since Michael 
Green and John Schwarz showed in 1984 that superstrings are not afflicted with fatal 
inconsistencies that threaten similar particle theories in ten dimensions. Much progress 
has been made since then. 

String theory has provided new and powerful tools for the understanding of conven- 
tional particle physics theories, gauge theories in particular. These are the kinds of theories 
that are used to formulate the Standard Model. Close cousins of these gauge theories 
arise on string theory D-branes. We examine D-branes and the theories that arise on them 
in detail beginning in Chapter 15. A remarkable physical equivalence between a certain 
four-dimensional gauge theory and a closed superstring theory (the AdS/CFT correspon- 
dence) is discussed in Chapter 23. As we will explain, the correspondence has been used to 
understand hydrodynamical properties of the quark-gluon plasma created in the collision 
of gold nuclei at heavy ion colliders. 

String theory has also made good strides towards a statistical mechanics interpretation of 
black hole entropy. We know from the pioneering work of Jacob Bekenstein and Stephen 
Hawking that black holes have both entropy and temperature. In statistical mechanics these 
properties arise if a system can be constructed in many degenerate ways using its basic 
constituents. Such an interpretation is not available in Einstein’s gravitation, where black 
holes seem to have few, if any, constituents. In string theory, however, certain black holes 
can be built by assembling together various types of D-branes and strings in a controlled 
manner. For such black holes, the predicted Bekenstein entropy is obtained by counting 
the ways in which they can be built with their constituent D-branes and strings. In fact, the 
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class of black holes amenable to string theory analysis continues to grow. We discuss this 
important development in Chapter 22. 

String theory will be needed to study cosmology of the Very Early Universe. String 
theory may provide a concrete model for the realization of inflation - a period of dramatic 
exponential expansion that the universe is likely to have experienced at the earliest times. 
The theory of inflation suggests that our universe is a growing bubble or region inside a 
space that continues to inflate for eternity. Bubbles continue to emerge forever and some 
have remarked that every model in the landscape may be physically realized in some bub- 
ble. Inflation does not appear to be eternal in the past, so some kind of beginning seems 
necessary. The deepest mysteries of the universe seem to lie hidden in a regime where 
classical general relativity surely breaks down. String theory should allow us to peer into 
this unknown realm. Some day we may be able to understand how the universe comes into 
being, if it does, or how the universe could have existed forever in the past, if it did. 

Most likely, answering such questions will require a mastery of string theory that goes 
beyond our present abilities. String theory is in fact an unfinished theory. Much has been 
learned about it, but in reality we have no complete formulation of the theory. A compar- 
ison with Einstein’s theory is illuminating. Einstein’s equations for general relativity are 
elegant and geometrical. They embody the conceptual foundation of the theory and feel 
completely up to the task of describing gravitation. No similar equations are known for 
string theory, and the conceptual foundation of the theory remains largely unknown. String 
theory is an exciting research area because the central ideas remain to be found. 

Describing nature and formulating the theory - those remain the present-day challenges 
of string theory. If surmounted, we will have a theory of all interactions, allowing us to 
understand the fate of spacetime and the mysteries of a quantum mechanical universe. With 
such high stakes, physicists are likely to investigate string theory until definite answers are 
found. 



r 

2 Special relativity and extra dimensions 

L 



The word relativistic, as used in the term “relativistic strings,” indicates consis- 
tency with Einstein’s theory of special relativity. We review special relativity and 
introduce the light-cone frame, light-cone coordinates, and light-cone energy. We 
then turn to the idea of additional, compact space dimensions and show with an 
example from quantum mechanics that, if small, these dimensions have little 
effect at low energies. 



2.1 Units and parameters 



Units are nothing other than fixed quantities that we use for purposes of reference. 
A measurement involves finding the unit-free ratio of an observable quantity to the appro- 
priate unit. Consider, for example, the definition of a second in the international system of 
units (SI system). The SI second (s) is defined to be the duration of 9 192631 770 periods 
of the radiation emitted in the transition between the two hyperfine levels of the cesium- 
133 atom. When we measure the time elapsed between two events, we are really counting a 
unit-free, or dimensionless, number: the number that tells us how many seconds fit between 
the two events or, alternatively, how many periods of the cesium radiation fit between the 
two events. The same goes for length. The unit called the meter (m) is nowadays defined as 
the distance traveled by light in a certain fraction of a second (1/299 792458 of a second, 
to be precise). Mass introduces a third unit, the prototype kilogram (kg), kept safely in 
Sevres, France. 

When doing dimensional analysis, we denote the units of length, time, and mass by L, 
T, and M, respectively. These are called the three basic units. A force, for example, has 
units 



[E] = MLT~ 2 , (2.1) 

where [A] denotes the units of the quantity X. Equation (2.1) follows from Newton’s law 
that equates the force on an object to the product of its mass and its acceleration. The 
newton (N) is the SI unit of force, and it equals kg-m/s 2 . 

It is interesting that no additional basic units are needed to describe other quantities. 
Consider, for example, electric charge. Do we need a new unit to describe charge? Not 
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really. This is easy to see in Gaussian units. In these units, Coulomb’s law for the force |F| 
between two charges q\ and qi separated by a distance r reads 



1*71*721 

1^1 = 2 ' 
r Z 


( 2 . 2 ) 


The units of charge are fixed in terms of other units because we have a force law where 
charges appear and all other quantities have known units. The esu is the Gaussian unit of 
charge, and it is defined by stating that two charges of one esu each, placed at a distance of 
one centimeter apart, repel each other with a force of one dyne (the Gaussian unit of force, 
1CT 5 N). Thus 


esu 2 = dyne • cm 2 = 10 -5 N • (10~ 2 m ) 2 


= 1CT 9 N • m 2 . (2.3) 


It follows from this equation that 




[esu 2 ] = [ N • m 2 ] , 


(2.4) 


and, using ( 2 . 1 ), finally we get 

[esu] = M 1/2 L 3/2 T~ l . 


(2.5) 



This expresses the esu in terms of the three basic units. 

In SI units, charge is measured in coulombs (C). The situation in SI units is a little 
more intricate, but the essential point is the same. A coulomb is defined in SI units as the 
amount of charge carried by a current of one ampere (A) in one second. The ampere itself 
is defined as the amount of current that, when carried by two wires separated by a distance 
of one meter, produces a force of 2 x 10 -7 N/m. The coulomb, as opposed to the esu, is 
not expressed in terms of meters, kilograms, and seconds. Coulomb’s law in SI units is 



\F\ = 



1 \q\qi\ 



with 



1 



= 8.99 x 10 y 



N-m 2 



( 2 . 6 ) 



4jreo r 1 47T60 

Note the presence of C -2 in the definition of the constant prefactor. Since each charge 
carries one factor of C, all the factors of C cancel in the calculation of the force. Two 
charges of one coulomb each, placed one meter apart, will each experience a force of 
8.99 x 10 9 N. This fact allows you to deduce (Problem 2.1) how many esus there are in 
a coulomb. Even though we do not write coulombs in terms of other units, this is just a 
matter of convenience. Coulombs and esus are related, and esus are written in terms of the 
three basic units. 



When we speak of parameters in a theory, it is convenient to distinguish between dimen- 
sionful parameters and dimensionless parameters. Consider, for example, a theory in which 
there are three types of particles with masses m\, m 2 , and m 3 . We can think of the the- 
ory as having one dimensionful parameter, the mass m \ of the first particle, say, and two 
dimensionless parameters, the mass ratios mj/m \ and m 3 /m 1 . 

String theory is said to have no adjustable parameters. By this it is meant that no dimen- 
sionless parameter is needed to formulate string theory. String theory does, however, have 
one dimensionful parameter. That parameter is the string length £ s . This length sets the 
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scale in which the theory operates. In the early 1970s, when string theory was first being 
formulated, the theory was thought to be a theory of hadrons. Back then, the string length 
was taken to be comparable to the nuclear scale. Nowadays, we think that string theory is 
a theory of fundamental forces and interactions. Accordingly, we set the string length to be 
much smaller than the nuclear scale. 



2.2 Intervals and Lorentz transformations 



Special relativity is based on the experimental fact that the speed of light (c ~ 3 x 10 s 
m/s) is the same for all inertial observers. This fact leads to some rather surprising conclu- 
sions. Newtonian intuition about the absolute nature of time, the concept of simultaneity, 
and other familiar ideas must be revised. In comparing the coordinates of events, two 
inertial observers, henceforth called Lorentz observers, find that the appropriate coordinate 
transformations mix space and time. 

In special relativity, events are characterized by the values of four coordinates: a time 
coordinate t and three spatial coordinates x, y, and z. It is convenient to collect these four 
numbers in the form (ct, x,y,z), where the time coordinate is scaled by the speed of light 
so that all coordinates have units of length. To make the notation more uniform, we use 
indices to relabel the space and time coordinates as follows: 

x M = (x°, x 1 , x 2 , x 3 ) = (ct, x, y, z). (2.7) 



Here the superscript // takes the four values 0, 1,2, and 3. The x ! ‘ are spacetime 
coordinates. 

Consider a Lorentz frame S in which two events are represented by the coordinates x 1 ' 
and x M + Ax /L . Consider now a second Lorentz frame S', in which the same two events are 
described by the coordinates x ,;i and x ,/J + Ax 7// , respectively. In general, not only are the 
coordinates x ,L and x"‘ different, so too are the coordinate differences Ax M and Ax 7// . On 
the other hand, both observers will agree on the value of the invariant interval A.v 2 . This 
interval is defined by 

- As 2 = -(Ax 0 ) 2 + (Ax 1 ) 2 + (Ax 2 ) 2 + (Ax 3 ) 2 . (2.8) 



Note the minus sign in front of (Ax 0 ) 2 , as opposed to the plus sign appearing before 
the spacelike differences (Ax') 2 (i = 1 , 2, 3). This sign encodes the fundamental differ- 
ence between time and space coordinates. The agreement on the value of the intervals is 
expressed as 



- (Ax 0 ) 2 + (Ax 1 ) 2 + (Ax 2 ) 2 + (Ax 3 ) 2 = -(Ax' 0 ) 2 + (Ax' 1 ) 2 + (Ax' 2 ) 2 + (Ax' 3 ) 2 , 

(2.9) 

or, in brief: 



As 2 = A s' 2 . 



(2.10) 
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The minus sign on the left-hand side of (2.8) implies that A.v 2 > 0 for events that are 
timelike separated. Timelike separated events are events for which 

(Ax 0 ) 2 > (Ax 1 ) 2 + (Ax 2 ) 2 + (Ax 3 ) 2 . (2.11) 

The history of a particle is represented in spacetime as a curve, the world-line of 
the particle. Any two events on the world-line of a particle are timelike separated, 
because no particle can move faster than light and therefore the distance light would 
have traveled in the time interval that separates the events must be larger than the 
space separation between the events. This is the content of (2.11). You at the time 
you were born and you at this moment are timelike separated events: a long time 
has passed and you have not gone that far. Events connected by the world-line of 
a photon are said to be lightlike separated. For such a pair of events, we have 
As 2 = 0, because in this case the two sides of (2.11) are identical: the spatial sepa- 
ration between the events coincides with the distance that light would have traveled 
in the time that separates the events. Two events for which As 2 < 0 are said to be 
spacelike separated. Events that are simultaneous in a Lorentz frame but occur at dif- 
ferent positions in that same frame are spacelike separated. It is because As 2 can 
be negative that it is not written as (As) 2 . For timelike separated events, however, 
we define 

As = V As 2 if As 2 > 0 (timelike interval) . (2.12) 

Many times it is useful to consider events that are infinitesimally close to each other. 
Small coordinate differences are needed to define velocities and are also useful in gen- 
eral relativity. Infinitesimal coordinate differences are written as dx 11 , and the associated 
invariant interval is written as ds 2 . Following (2.8), we have 

- ds 2 = -(dx 0 ) 2 + (dx 1 ) 2 + (dx 2 ) 2 + (dx 3 ) 2 . (2.13) 

The equality of intervals is the statement 

ds 2 = ds' 2 . (2.14) 

A very useful notation can be motivated by trying to simplify the expression for the 
invariant ds 2 . To do this, we introduce symbols that carry subscripts instead of superscripts. 
Fet us define 

dxo = —dx°, dx\ = c/x 1 , dx 2 = dx 2 , dx 2 = dx 3 . (2.15) 

The only significant change is the inclusion of a minus sign for the zeroth component. All 
together, we write 

0 12 3 

dXfj, = (dxo, dxi, dx 2 , dx$) = (— dx ,dx ,dx ,dx ). (2.16) 

Now we can rewrite ds 2 in terms of dx 1 ' and dx^\ 

—ds 2 = —(dx 0 ) 2 + (dx 1 ) 2 + (dx 2 ) 2 + (dx 3 ) 2 
= dxodx 0 + dx\dx 1 + dxidx 2 + dx$dx 3 , 



(2.17) 
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and we see that the minus sign in (2.13) is gone. The invariant interval has become 

3 

- ds 1 = dXfidx 11 . (2.18) 

M=0 

Throughout the rest of the text we will use Einstein’s summation convention. In this con- 
vention, indices repeated in a single term are to be summed over the appropriate set of 
values. We do not consider indices to be repeated when they appear in different terms. 
For example, there are no sums implied by a 11 + b ,L or a 1 ' = b 11 , but there is an implicit 
sum in a^b^. A repeated index must appear once as a subscript and once as a superscript 
and should not appear more than twice in any one term. The letter chosen for the repeated 
index is not important; thus is the same as a v b v . Because of this, repeated indices 
are sometimes called dummy indices! Using the summation convention, we can rewrite 
(2.18) as 

-ds 2 = dx /1 dx l *. (2.19) 

Just as we did for finite coordinate differences in (2.12), for infinitesimal timelike intervals 
we define the quantity 

ds = V ds 2 if ds 2 > 0 (timelike interval) . (2.20) 

We can also express the interval ds 2 using the Minkowski metric rj llv . This is done by 
writing 

— ds 2 — v , dx^dx v . (2.21) 

Equation (2.21), by itself, does not determine the metric rj IJV . If, in addition, we require 
rj^v to be symmetric under an exchange in the order of its indices, 

b /iv = bv[i > (2.22) 

then this together with (2.21) completely determines a metric called the Minkowski met- 
ric. It is reasonable to declare that t] /lv be symmetric for, as we shall now see, any 
antisymmetric component would be irrelevant. 

Any two-index object M llv can be decomposed into a symmetric part and an antisym- 
metric part: 

1 1 

Mfi V — + M v ^) + M Vj a ) . (2.23) 

The first term on the right-hand side, the symmetric part of M, is invariant under exchange 
of the indices /x and v. The second term on the right-hand side, the antisymmetric part of 
M, changes sign under exchange of the indices /x and v. If had an antisymmetric part 
(= — ! u/i ), then its contribution would drop out of the right-hand side of (2.21). We 
can see this as follows: 

H)j.v dx fl dx v — (— fi, M ) dx ll dx v — —i; [iv dx v dx l * — — dx^dx v . (2.24) 

In the first step, we used the antisymmetry of In the second step, we relabeled the 
dummy indices: the /x were changed into v and vice versa. In the third step, we switched 
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the order of the dx 1 ' and dx v factors. The result is that ^ IIV dx ll dx v is identical to minus 
itself, and therefore it vanishes. It would be useless for p //v; to have an antisymmetric part, 
so we simply declare that it has none. 

Since repeated indices are summed over, equation (2.21) means that 

— ds 2 = rjoodx°dx° + r]Q[dx°dx l + i]\o dx l dx° + i]i\dx l dx l + ■ ■ (2.25) 

Comparing with (2.17) and using (2.22), we see that 7700 = — 1 , 77 1 1 = 7722 = 7733 = 1, and 
all other components vanish. We collect these values in matrix form: 



/ 

= 

V 



-1 

0 

0 

0 



0 0 0 \ 

1 0 0 
0 1 0 
001/ 



(2.26) 



In this equation, which follows a common identification of two-index objects with matri- 
ces, we think of //, the first index in r], as the row index, and v, the second index in rj, as 
the column index. The Minkowski metric can be used to “lower indices.” Indeed, equation 
(2.15) can be rewritten as 

dxp = r ] yLV dx v . (2.27) 



If we are handed a set of quantities b 1 ' , we always define 



bjx = Viivb v . 

Given objects a 11 and b 11 , the relativistic scalar product a ■ b is defined as 



(2.28) 



a ■ b = a^b^ = rj^ v a^b v = — a°b° + a l b l + a 2 b 2 + a^b 3 . (2.29) 



Applied to (2.19), this gives —ds 2 = dx ■ dx. Note that a^b^ = a^b^. 

It is convenient to introduce the matrix inverse for ^ v . Written conventionally as if v , 
it is given by 



/ "I 
0 
0 

V 0 



0 0 0 \ 
1 0 0 
0 1 0 
001 / 



(2.30) 



You can see by inspection that this matrix is indeed the inverse of the matrix in (2.26). 
When thinking of as a matrix, as with t] llv , the first index is a row index and the 
second index is a column index. In index notation the inverse property is written as 



ri Vp dru = K , 



where the Kronecker delta 8^ is defined by 



(2.31) 



8 



1 if p. — v 

0 if fi ^ v. 



(2.32) 
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y AS 



. S' 



vt ■ 



o 



O' 



Two Lorentz frames connected by a boost. S' is boosted along the +x direction of 5 with 
boost parameter /? = v/c. 



Note that the repeated index p in (2.31) produces the desired matrix multiplication. The 
Kronecker delta can be thought of as the index representation of the identity matrix. The 
metric with upper indices can be used to “raise indices.” Using (2.28) and (2.31), we get 

r) m = ti pp (V„ v b v ) = ( V pp Vuv) b" = 8 p b v = b p . (2.33) 

The lower p index of /; /i was raised by q pp to become an upper p index. The last step in 
the above calculation needs a little explanation: 8% b v = b p because as we sum over v, 8 P 
vanishes unless v — p, in which case it equals one. 



Lorentz transformations are the relations between coordinates in two different inertial 
frames. Consider a frame S and a frame S', that is moving along the positive x direction 
of the S frame with a velocity v, as shown in Figure 2.1. Assume the coordinate axes for 
both systems are parallel, and that the origins coincide at the common time t = t' = 0. We 
say that S' is boosted along the x direction with velocity parameter p = v/c. The Lorentz 
transformations in this case read 



x' = y(x — Pet), 
ct' — y(ct — Px), 

y' = y, 

z = z, 

where the Lorentz factor y is given by 

1 _ 1 

K /mT 

Using indices, and changing the order of the first two equations, we arrive at 

x'° = y(x° — Px 1 ), 
x' 1 = y(-Px° + x l ), 



(2.34) 



(2.35) 



(2.36) 
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In the above transformations, the coordinates x 2 and x 3 remain unchanged. These are the 
coordinates orthogonal to the direction of the boost. The inverse Lorentz transformations 
give the values of the x coordinates in terms of the x' coordinates. They are readily found 
by solving for the x in the above equations. The result is the same set of transformations 
with x and x' exchanged and with ft replaced by (—ft), as required by symmetry. 

The coordinates in the above equations satisfy the relation 

(x 0 ) 2 - (x 1 ) 2 - (x 2 ) 2 - (x 3 ) 2 = (x'°) 2 - (x' 1 ) 2 - (x' 2 ) 2 - (x' 3 ) 2 , (2.37) 



as you can show by direct computation. This is just the statement of invariance of the inter- 
val As 2 between two events: the first event is represented by (0, 0, 0, 0) in both S and 
S', and the second event is represented by coordinates x 11 in S and x' 1 ' in S' . By defini- 
tion, Lorentz transformations are the linear transformations of coordinates that respect the 
equality (2.37). 

In general, we write a Lorentz transformation as the linear relation 






= Lf, 



(2.38) 



where the entries L >l v are constants that define the linear transformation. For the boost in 
(2.36), we have 



[L] = L\ 



( Y ~Yft 0 0 \ 

—yft y 0 0 

0 0 10 

\ 0 0 0 1 / 



(2.39) 



In defining the matrix L as [L] = L'f, we are following the convention that the first index 
is a row index and the second index is a column index. This is why the lower index in L 
is written to the right of the upper index. 

The coefficients L ll v are constrained by equation (2.37). In index notation, this equation 
requires 



„ v a ft 

Vaft X X 



rH /v 

V/xv X X . 



(2.40) 



Using (2.38) twice on the right-hand side above gives 

flap x a x p = I 1/IV (L\ x a )(L V p xh = Vftv L^ a L v p x“ x^ . (2.41) 

Equivalently, we have the equation 

k a p x" x^ = 0 , with k a p = r)n V L' l a L v p - i] a p . (2.42) 

Since k a p x“ x^ = 0 must hold for all values of the coordinates x, we find that 



kafi + kp a — 0 , 



(2.43) 



as you should convince yourself by writing out the sums over a and ft. Since k a p is in 
fact symmetric under the exchange in the order of its indices, (2.43) implies k a p = 0. This 
means that 



fftiv L 1 a L p — r\ a p . 



(2.44) 
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Rewriting (2.44) to make it look more like matrix multiplication, we have 

L v p = rj ap . (2.45) 

The sum over the v index works well: it is a column index in i] llv and a row index in L v 
The fi index, however, is a row index in l!‘ a , while it should be a column index to match 
the row index in rj jlv . Moreover, the a index in L l ‘ a is a column index, while it is a row 
index in rt a p. This means that we should exchange the columns and rows of lJ l a , which 
is the matrix operation of transposition. Therefore equation (2.45) can be rewritten as the 
matrix equation 

l7 ii L = r] . (2.46) 

Here i] is the matrix whose entries are q llv . This neat equation is the constraint that L must 
satisfy to be a Lorentz transformation. 

An important property of Lorentz transformations can be deduced by taking the deter- 
minant of each side of equation (2.46). Since the determinant of a product is the product of 
the determinants, we get 

(det L T )(det r;)(det L) = det ij . (2.47) 

Cancelling the common factor of det r] and recalling that the operation of transposition 
does not change a determinant, we find 

(detL) 2 = 1 — * detL = ±1 . (2.48) 

You can check that det L = 1 for the boost in (2.39). Since det L never vanishes, the matrix 
L is always invertible and, consequently, all Lorentz transformations are invertible linear 
transformations. 

The set of Lorentz transformations includes boosts along each of the spatial coordinates. It 
also includes rotations of the spatial coordinates. Under a spatial rotation, the coordinates 
(x°, x 1 , x 2 , x 3 ) of a point transform into coordinates (x'°, x' 1 , x' 2 , x' 3 ), for which x° = 
x'°, because time is unaffected. Since the spatial distance from a point to the origin is 
preserved under a rotation, we have 

(x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 = (x' 1 ) 2 + (x' 2 ) 2 + (x' 3 ) 2 . (2.49) 

This, together with x° = x'°, implies that (2.37) holds. Therefore spatial rotations are 
Lorentz transformations. 

Any set of four quantities which transforms under Lorentz transformations in the same 
way as the x fL do is said to be a four- vector, or Lorentz vector. When we use index notation 
and write //' , we mean that b fl is a four-vector. Taking differentials of the linear equations 
(2.36), we see that the linear transformations that relate x’ to x also relate dx to dx. 
Therefore the differentials dx ^ define a Lorentz vector. In the spirit of index notation, a 
quantity with no free indices must be invariant under Lorentz transformations. A quantity 
has no free indices if it carries no index or if it contains only repeated indices, such as a^b^. 

A four-vector a 11 is said to be timelike if a 2 = a ■ a < 0, spacelike if a 2 > 0, and null 
if a 2 = 0. Recalling our discussion below (2.11), we see that the coordinate differences 
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between timelike-separated events define a timelike vector. Similarly, the coordinate dif- 
ferences between spacelike-separated events define a spacelike vector, and the coordinate 
differences between lightlike-separated events define a null vector. 

Quick calculation 2.1 Verify that the invariant ds 2 is indeed preserved under the Lorentz 
transformations (2.36). 

Quick calculation 2.2 Consider two Lorentz vectors a ^ and b 1 ' . Write the Lorentz trans- 
formations a ^ — »• a' 1 ' and //' — > b' 1 ' analogous to (2.36). Verify that is invariant 
under these transformations. 



2.3 Light-cone coordinates 



We now discuss a coordinate system that will be extremely useful in our study of string 
theory, the light-cone coordinate system. The quantization of the relativistic string can 
be worked out most directly using light-cone coordinates. There is a different approach 
to the quantization of the relativistic string, in which no special coordinates are used. This 
approach, called Lorentz covariant quantization, is discussed briefly in Chapter 24. Lorentz 
quantization is very elegant, but a full discussion requires a great deal of background 
material. We will use light-cone coordinates to quantize strings in this book. 

We define the two light-cone coordinates x + and x~ as two independent linear combi- 
nations of the time coordinate and a chosen spatial coordinate, conventionally taken to be 
x 1 . This is done by writing 




x 




(x°+x l ), 
(x° -x 1 ). 



(2.50) 



The coordinates x 2 and x 3 play no role in this definition. In the light-cone coordinate 
system, (x°, x 1 ) is traded for (x + , x~), but the other two coordinates x 2 , x 3 are kept. Thus, 
the complete set of light-cone coordinates is (x + , x~, x 2 , x 3 ). 

The new coordinates x + and x _ are called light-cone coordinates because the associated 
coordinate axes are the world-lines of beams of light emitted from the origin along the x 1 
axis. For a beam of light moving in the positive x 1 direction, we have x 1 = ct = x°, and 
thus x~ — 0. The line x~ — 0 is, by definition, the x + axis (Figure 2.2). For a beam of light 
moving in the negative x 1 direction, we have x 1 = — ct = — x°, and thus x + = 0. This 
corresponds to the x~ axis. The x 3 ^ axes are lines at 45° with respect to the x , x 1 axes. 

Can we think of x + , or perhaps x“, as a new time coordinate? Yes. In fact, both have 
equal right to be called a time coordinate, although neither one is a time coordinate in 
the standard sense of the word. Light-cone time is not quite the same as ordinary time. 
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A spacetime diagram with x 1 and x° represented as orthogonal axes. Shown are the 
light-cone axes x ± = 0. The curves with arrows are possible world-lines of physical particles. 



Perhaps the most familiar property of time is that it goes forward for any physical motion 
of a particle. Physical motion starting at the origin is represented in Figure 2.2 as curves that 
remain within the light-cone and whose slopes never go below 45°. For all these curves, 
both x + and x~ increase as we follow the arrows. The only subtlety is that, for special 
light rays, light-cone time will freeze! As we saw above, x + remains constant for a light 
ray in the negative x 1 direction, while x~ remains constant for a light ray in the positive 
x 1 direction. 

For definiteness, we will take x + to be the light-cone time coordinate. Accordingly, 
we will think of x~ as a spatial coordinate. Of course, these light-cone time and space 
coordinates will be somewhat strange. 

Taking differentials of (2.50), we readily find that 

2 dx + dx~ = {dx° + dx 1 ) {dx° — dx 1 ) = ( dx °) 2 — (dx 1 ) 2 . (2.51) 

It follows that the invariant interval (2.13), expressed in terms of the light-cone coordinates 
(2.50), takes the form 



- ds 2 = -2 dx + dx~ + {dx 2 ) 2 + {dx 2 ) 2 . (2.52) 



The symmetry in the definitions of x + and x _ is evident here. Notice that, if we are given 
ds 2 , solving for dx~ or for dx + does not require us to take a square root. This is a very 
important feature of light-cone coordinates, as we will see in Chapter 9. 

How do we represent (2.52) with index notation? We still need indices that run over four 
values, but this time the values will be called 



+ ,-,2,3. 



(2.53) 
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Just as we did in (2.21), we write 

— ds 2 = fj llv dx IJ 'dx v . (2.54) 

Here we have introduced a light-cone metric i) which, like the Minkowski metric, is also 
defined to be symmetric under the exchange of its indices. Expanding this equation, and 
comparing with (2.52), we find 

»)+- = n— I- = -1 . n++ = f ] — = 0. (2.55) 



In the (+, — ) subspace, the diagonal elements of the light-cone metric vanish, but the off- 
diagonal elements do not. We also find that fj does not couple the (+, — ) subspace to the 
(2, 3) subspace: 

r\ + i — fj-i = 0 , I — 2,3 . (2.56) 



The matrix representation of the light-cone metric is 



/ 0 -1 0 0 \ 
-1 0 0 0 
0 0 10 
0 0 0 1 / 



(2.57) 



The light-cone components of any Lorentz vector a 11 are defined in analogy with (2.50): 



a 



1 

V2 

1 

7 ! 



0 a° + a l ), 
(a 0 -a 1 ). 



(2.58) 



The scalar product between vectors, shown in (2.29), can be written using light-cone 
components. This time we have 



a ■ b = —a b + —a + b + a 2 b 2 + cr'b 2 = a^b v . (2.59) 



The last equality follows immediately from summing over the repeated indices and using 
(2.57). The first equality needs a small computation. In fact, it suffices to check that 

— a~b + — a + b~ = — a°b° + a l b l . (2.60) 

This is quickly done using (2.3) and the analogous equations for b ± . We can also introduce 
lower light-cone indices. Consider the expression a ■ b = a^b 11 , and expand the sum over 
the index fi using the light-cone labels: 

a ■ b — a+b + + a-b~ + ajb 2 + aj,b 3 . (2.61) 

Comparing with (2.59), we find that 

a+ = —a - , a- — —a + . (2.62) 
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When we lower or raise the zeroth index in a Lorentz frame, we get an extra sign. In light- 
cone coordinates, the indices of the first two coordinates switch and we get an extra sign. 

Since physics described using light-cone coordinates looks unusual, we must develop an 
intuition for it. To do this, we will look at an example where the calculations are simple 
but the results are surprising. 

Consider a particle moving along the x 1 axis with speed parameter 0 = v/c. At time 
t = 0, the positions x 1 , x 2 , and x 3 are all zero. Motion is nicely represented when the 
positions are expressed in terms of time: 

x\t) = vt = 0x°, x 2 (t) = X 3 (0 = 0. (2.63) 



How does this look in light-cone coordinates? Since x + is time and x 2 = x 3 = 0, we must 
simply express x~ in terms of x + . Using (2.63), we find 



x° + x 1 1 + 0 0 

= = x ' 



(2.64) 



As a result. 



_ x°-x x (1-0) 1-0 + 

x = — — = — — x = X . 

V2 V2 1 + 0 



(2.65) 



Since it relates light-cone position to light-cone time, we identify the ratio 



dx 1 — 0 

dx + 1 + 0 



( 2 . 66 ) 



as the light-cone velocity. How strange is this light-cone velocity? For light moving to the 
right (0 — 1) it equals zero. Indeed, light moving to the right has zero light-cone veloc- 
ity because x~ does not change at all. This is shown as line 1 in Figure 2.3. Suppose 
you have a particle moving to the right with high conventional velocity, so that 0 ~ 1 
(line 2 in the figure). Its light-cone velocity is then very small. A long light-cone time 
must pass for this particle to move a little in the x~ direction. Perhaps more interestingly, 
a static particle in standard coordinates (line 3) is moving quite fast in light-cone coordi- 
nates. When 0 = 0 the particle has unit light-cone speed. This light-cone speed increases 
as 0 grows negative: the numerator in (2.66) is larger than one and increasing, while the 
denominator is smaller than one and decreasing. For 0 = — 1 (line 5), the light-cone veloc- 
ity is infinite! While this seems odd, there is no clash with relativity. Light-cone velocities 
are just unusual. The light-cone is a frame in which kinematics has a nonrelativistic fla- 
vor and infinite velocities are possible. Note that light-cone coordinates were introduced 
as a change of coordinates, not as a Lorentz transformation. There is no Lorentz trans- 
formation that takes the coordinates (x°, x 1 , x 2 , x 3 ) into coordinates (x'°, x' 1 , x' 2 , x' 3 ) = 
(x+, x _ , x 2 , x 3 ). 



Quick calculation 2.3 Convince yourself that the last statement above is correct. 
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World-lines of particles with various light-cone velocities. Particle 1 has zero light-cone 
velocity. The velocities increase through that of particle 5, which is infinite. 



2.4 Relativistic energy and momentum 



In special relativity there is a basic relationship between the rest mass m of a point particle, 
its relativistic energy E, and its relativistic momentum p. This relationship is given by 

j ? 2 

— —p • p = m 2 c 2 . (2.67) 

c- 

The relativistic energy and momentum are given in terms of the rest mass and velocity by 
the following familiar relations: 

E = ymc 2 , p = ymv. (2.68) 

Quick calculation 2.4 Verify that the above E and p satisfy (2.67). 

Energy and momentum can be used to define a momentum four-vector, as we will prove 
shortly. This four-vector is 

P' L = (p°, P 1 ,P 2 , P 3 ) = (p Px, Py, Pz )• (2.69) 

Using the last two equations, we have 

p M , p j = my(c, v). (2.70) 

We use (2.28) to lower the index in p ^ : 

/V = (P0, Pi, P2, Pi) = r/^vP v = Px, Py, Pz) ■ (2-71) 

The above expressions for p 11 and /; ;l give 

E 2 

p^pii = ~(p°y + p-p = — 2 +p-P’ 



(2.72) 
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and, making use of (2.67), we have 

p‘ l p,i = —m 2 c 2 . (2.73) 

Since p ^ p p has no free index it must be a Lorentz scalar. Indeed, all Lorentz observers 
agree on the value of the rest mass of a particle. Using the relativistic scalar product 
notation, condition (2.73) reads 



2 2 2 

p = p ■ p — —m c . 



(2.74) 



A central concept in special relativity is that of proper time. Proper time is a Lorentz 
invariant measure of time. Consider a moving particle and two events along its trajectory. 
Different Lorentz observers record different values for the time interval between the two 
events. But now imagine that the moving particle is carrying a clock. The proper time 
elapsed is the time elapsed between the two events on that clock. By definition, it is an 
invariant: all observers of a particular clock must agree on the time elapsed on that clock! 

Proper time enters naturally into the calculation of invariant intervals. Consider an 
invariant interval for the motion of a particle along the x axis: 

- ds 2 = - c 2 dt 2 + dx 2 = - c 2 dt 2 (1 - p 2 ). (2.75) 



Now evaluate the interval using a Lorentz frame attached to the particle. This is a frame in 
which the particle does not move and time is recorded by the clock that is moving with the 
particle. In this frame dx — 0 and dt — dt p is the proper time elapsed. As a result, 

- ds 2 = -c 2 dt 2 . (2.76) 

We cancel the minus signs and take the square root (using (2.20)) to find 



ds = cdtp. 



(2.77) 



This shows that, for timelike intervals, ds/c is the proper time interval. Similarly, 
cancelling minus signs and taking the square root of (2.75) gives 

I 7 dt y 

ds = cdtJ 1 - p 2 — ► — = (2.78) 

v ds c 



Being a Lorentz invariant, ds can be used to construct new Lorentz vectors from old 
Lorentz vectors. For example, a velocity four-vector id' is obtained by taking the ratio 
of dx 1 ' and ds. Since dx ^ is a Lorentz vector and ds is a Lorentz scalar, the ratio is also a 
Lorentz vector: 



u 






dx^ 

ds 




diet ) dx dy dz 
ds ' ds' ds' ds 



(2.79) 



The factor of c is included to give u 1 ' the units of velocity. The components of id' can be 
simplified using the chain rule and (2.78). For example, 



dx dx dt v x y 

ds dt ds c 



(2.80) 
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Back in (2.79), we find 

= y(c, v x , v y , v z ) = y(c, v). (2.81) 

Comparing with (2.70), we see that the momentum four-vector is just mass times the 
velocity four-vector: 

p' L = mu‘ \ (2.82) 

This confirms our earlier assertion that the components of p 1 ' form a four-vector. Since 
any four-vector transforms under Lorentz transformations as the x 1 ' do, we can use (2.36) 
to find that under a boost in the x -direction the p 1 ' transform as 

— = y{ — Ppx)> 

c V c / 

p'x = Y {~P~ + Px)- ( 2 - 83 ) 

2.5 Light-cone energy and momentum 



The light-cone components p + and p of the momentum Lorentz vector are obtained 
using the rule (2.3): 



P + = ~^(p 0 + p 1 )= -P-, 
P~ = 4= (P° ~ P l ) = -P+- 

V2 



(2.84) 



Which component should be identified with light-cone energy? The naive answer would 
be p + . In any Lorentz frame, both the time and energy are the zeroth components of their 
respective four- vectors. Since light-cone time was chosen to be x + , we might conclude that 
light-cone energy should be taken to be p + . This is not appropriate, however. Light-cone 
coordinates do not transform as Lorentz ones do, so we should be careful and examine this 
question in detail. Both p ± are energy-like, since both are positive for physical particles. 
Indeed, from (2.67), and with m ^ 0, we have 



P° = — = J P ■ P + m 2 c 2 > | p | > Ip 1 1 . (2.85) 

c v 

As a result, p" ± p 1 > 0, and thus p ± > 0 . While both are plausible candidates for energy, 
the physically motivated choice turns out to be —p+, which happens to coincide with p~. 
Before we explain this choice, let us first evaluate p^ x 11 . In standard coordinates, 

p ■ x = pox 0 + p\x 1 + p 2 X 2 + P 3 X 3 . (2.86) 



In light-cone coordinates, using (2.61), 

p ■ x = p+x + + p-X~ + P2X 2 + P3X 3 . 



(2.87) 



29 



2.5 Light-cone energy and momentum 



In standard coordinates, po — —E /c appears together with the time x°. In light-cone coor- 
dinates, p + appears together with the light-cone time x + . We would therefore expect p + 
to be minus the light-cone energy. 

Why is this pairing significant? Energy and time are conjugate variables. As you learned 
in quantum mechanics, the Hamiltonian operator measures energy and generates time 
evolution. The wavefunction of a point particle with energy E and momentum p is given by 



\l/(t,x)=exp(—-(Et — p-x) S j. (2.88) 

Indeed, this wavefunction satisfies the Schrodinger equation 

9i If E 

= ( 2 - 89 ) 

Similarly, light-cone time evolution and light-cone energy E\ c should be related by 



H £ic . 

lh ITT = — V- 
dx + c 

To find the x + dependence of the wavefunction, we recognize that 

x[r(t, x) = exp( (pox 0 + p ■ x)) = exp( p ■ x ) , 
and, using (2.87), we have 

yHx) = exp^ l - (p+x + + p-x~ + p 2 x 2 + P ^)\ 
We can now return to (2.90) and evaluate: 



'"3^ = ~ p+ * 



~P+ 



E\ c 



(2.90) 



(2.91) 



(2.92) 



(2.93) 



This confirms our identification of (—p+) with light-cone energy. Since, presently, —p+ = 
p~, it is convenient to use p~ as the light-cone energy in order to eliminate the sign in the 
above equation: 



P = 




(2.94) 



Some physicists like to raise and lower + and — indices to simplify expressions involving 
light-cone quantities. While this is sometimes convenient, it can easily lead to errors. If 
you talk with a friend over the phone, and she says “. . . /;-plus times . . .,” you will have to 
ask, “plus up, or plus down?” In the rest of this book we will not lower the + or — indices. 
They will always be up, and the energy will always be p~ . 

We can check that the identification of p~ as light-cone energy fits together nicely with 
the intuition that we have developed for light-cone velocity. To this end, we confirm that a 
particle with small light-cone velocity also has small light-cone energy. Suppose we have 
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a particle moving very fast in the +X 1 direction. As discussed below (2.66), its light-cone 
velocity is very small. Since p l is very large, equation (2.67) gives 



P 



o 





m 2 c 2 

( P l ) 2 



2 2 
me 

W’ 



(2.95) 



The light-cone energy of the particle is therefore 

1 n , m 2 c 2 

As anticipated, both the light-cone velocity and the light-cone energy decrease as p l 
increases. 



2.6 Lorentz invariance with extra dimensions 



If string theory is correct, we must entertain the possibility that spacetime has more than 
four dimensions. The number of time dimensions must be kept equal to one - it seems 
very difficult, if not altogether impossible, to construct a consistent theory with more than 
one time dimension. The extra dimensions must therefore be spatial. Can we have Lorentz 
invariance in worlds with more than three spatial dimensions? Yes. Lorentz invariance is a 
concept that admits a very natural generalization to spacetimes with additional dimensions. 

We first extend the definition of the invariant interval els 2 to incorporate the additional 
space dimensions. In a world with five spatial dimensions, for example, we would write 

- ds 2 = —c 2 dt 2 + (dx 1 ) 2 + (dx 2 ) 2 + ( dx 3 ) 2 + ( dx 4 ) 2 + ( dx 5 ) 2 . (2.97) 

Lorentz transformations are then defined as the linear changes of coordinates that leave 
ds 2 invariant. This ensures that every inertial observer in the six-dimensional spacetime 
will agree on the value of the speed of light. With more dimensions, come more Lorentz 
transformations. While in four-dimensional spacetime we have boosts in the x 1 , x 2 , and 
x 3 directions, in this new world we have boosts along each of the five spatial dimensions. 
With three spatial coordinates, there are three basic spatial rotations: rotations that mix x 1 
and x 2 , those that mix x 1 and x 3 , and finally those that mix x 2 and x 3 . The equality of 
the number of boosts and the number of rotations is a special feature of four-dimensional 
spacetime. With five spatial coordinates, we have ten rotations, which is twice the number 
of boosts. 

The higher-dimensional Lorentz invariance includes the lower-dimensional one: if noth- 
ing happens along the extra dimensions, then the restrictions of lower-dimensional Lorentz 
invariance apply. This is clear from (2.97). For motion that does not involve the extra 
dimensions, dx 4 = dx 5 = 0, and the expression for ds 2 reduces to that used in four 
dimensions. 
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A one-dimensional world that repeats each 2nR. Several copies of Phil are shown. 



2.7 Compact extra dimensions 



It is possible for additional spatial dimensions to be undetected by low energy experiments 
if the dimensions are curled up into a compact space of small volume. In this section, we 
will try to understand what a compact dimension is. We will focus mainly on the case of 
one dimension. In Section 2.10 we will explain why small compact dimensions are hard to 
detect. 

Consider a one-dimensional world, an infinite line, say, and let x be a coordinate along 
this line. For each point P along the line, there is a unique real number x(P) called the 
x-coordinate of the point P. A good coordinate on this infinite line satisfies two conditions. 

• Any two distinct points P\ ^ p 2 have different coordinates: x ( P\ ) ^ x(P 2 ). 

• The assignment of coordinates to points is continuous: nearby points have nearly equal 
coordinates. 

If a choice of origin is made for this infinite line, then we can use distance from the origin 
to define a good coordinate. The coordinate assigned to each point is the distance from that 
point to the origin, with a sign depending upon which side of the origin the point lies. 

Imagine that you live in a world with one spatial dimension. Suppose you are walking 
along and notice a strange pattern: the scenery repeats each time you move a distance 
2i zR, for some value of R. If you meet your friend Phil, you see that there are Phil clones 
at distances 2it R, 4tt R, 6tc R, . . . down the line (see Figure 2.4). In fact, there are clones 
up the line, as well, with the same spacing. 

There is no way to distinguish an infinite line with such a strange property from a circle 
with circumference 2nR. Indeed, saying that this strange line is a circle explains the pecu- 
liar property - there really are no Phil clones; you meet the same Phil again and again as 
you go around the circle! 

How do we express this mathematically? We can think of the circle as the open line 
with an identification. That is, we declare that points with coordinates that differ by 2 n R 
are the same point. More precisely, two points are declared to be the same point if their 
coordinates differ by an integer number of 27 rR : 



Pi ~ Pi 



x(P\) = x{P 2 ) + 2 jtR n, n e Z. 



(2.98) 
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The interval 0 < x < 2 nR is a fundamental domain for the line with the identification 
( 2 . 99 ). The identified space is a circle of radius R. 



This is precise, but somewhat cumbersome, notation. With no risk of confusion, we can 
simply write 

x ~ x + 2nR, (2.99) 

which should be read as “identify any two points whose coordinates differ by 2 nR.” With 
such an identification, the open line becomes a circle. The identification has turned a non- 
compact dimension into a compact one. It may seem to you that a line with identifications 
is only a complicated way to think about a circle. We will see, however, that many physical 
problems become clearer when we view a compact dimension as an extended one with 
identifications. 

The interval 0 < x < 2 k R is a fundamental domain for the identification (2.99) (see 
Figure 2.5). A fundamental domain is a subset of the entire space that satisfies two 
conditions. 

1. No two points in the fundamental domain are identified. 

2. Any point in the entire space is in the fundamental domain or is related by the 
identification to some point in the fundamental domain. 

Whenever possible, as we did here, the fundamental domain is chosen to be a connected 
region. To build the space implied by the identification, we take the fundamental domain 
together with its boundary, and implement the identifications on the boundary. In our case, 
the fundamental domain together with its boundary is the segment 0 < x < 2 tcR. In this 
segment we identify the point x = 0 with the point x = 2: rR. The result is the circle. 

A circle of radius R can be represented in a two-dimensional plane as the set of points that 
are at a distance R from a point called the center of the circle. Note that the circle obtained 
above has been constructed directly, without the help of an embedding two-dimensional 
space. For our circle, there is no point, anywhere, that represents the center of the circle. 
We can still speak, figuratively, of the radius R of the circle, but in our case, the radius is 
simply the quantity which multiplied by 2 jr gives the total length of the circle. 

On the circle, the coordinate x is no longer a good coordinate. The coordinate x is now 
either multivalued or discontinuous. This is a problem with any coordinate on a circle. 
Consider using angles to assign coordinates on the unit circle (Figure 2.6). Fix a reference 
point Q on the circle, and let O denote the center of the circle. To any point P on the circle 
we assign as a coordinate the angle 9(P) = LPOQ. This angle is naturally multivalued. 
The reference point Q, for example, has 0(Q) = 0° and 9(Q) = 360°. If we force angles 
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Using the angle 6 to define a coordinate on a circle. The reference point Q is assigned zero 
angle: 0(Q) = 0. The coordinate 0 is naturally multivalued. 



to be single valued by restricting 0° < 6 < 360°, for example, then they become discon- 
tinuous. Indeed, two nearby points, Q and Q', then have very different angles: ()( Q) — 0, 
while O(Q') ~ 360°. It is easier to work with multivalued coordinates than it is to work 
with discontinuous ones. 

If we have a world with several open dimensions, then we can apply the identification 
(2.99) to one of the dimensions, while doing nothing to the others. The dimension described 
by x turns into a circle, and the other dimensions remain open. It is possible, of course, to 
make more than one dimension compact. Consider, for example, the ( x , y ) plane, subject 
to two identifications: 



x ~ x + 2n R , y ~ y + 2nR. (2.100) 

It is perhaps clearer to show both coordinates simultaneously while writing the identifica- 
tions. In that fashion, the two identifications are written as 

(x,y) ~ (;t +2tcR, y), (2.101) 

(x, v) ~ (x, y + 2 ttR). (2.102) 

The first identification implies that we can restrict our attention to 0 < x < 2 ttR, and the 
second identification implies that we can restrict our attention to 0 < y < 2it R. Thus the 
fundamental domain can be taken to be the square region 0 < x, y < 2 tt R, as shown in 
Figure 2.7. To build the space implied by the identifications, we take the fundamental 
domain together with its boundary, forming the full square 0 < x, y < 2itR, and imple- 
ment the identifications on the boundary. The vertical edges are identified because they 
correspond to points of the form (0, y) and (2;r R . y), which are identified by (2.101). The 
horizontal edges are identified because they correspond to points of the form (x, 0) and 
(x, 2jt R), which are identified by (2.102). The resulting space is called a two-dimensional 
torus. One can visualize the torus by taking the fundamental domain (with its boundary) 
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B 

A square region in the plane with identifications indicated by the dashed lines and 
arrowheads. The resulting surface is a torus. The identification of the vertical lines gives a 
cylinder, shown to the right of the square region. The cylinder, shown horizontally and 
flattened in the bottom left, must have its edges glued to form the torus. 




and gluing the vertical edges as their identification demands. The result is a cylinder, as 
shown in the top right comer of Figure 2.7 (with the gluing seam dashed). In this cylin- 
der, however, the bottom circle and the top circle must also be glued, since they are nothing 
other than the horizontal edges of the fundamental domain. To do this with paper, you must 
flatten the cylinder and then roll it up to glue the circles. The result looks like a flattened 
doughnut. With a flexible piece of garden hose, you could simply identify the two ends to 
obtain the familiar picture of a torus. 

We have seen how to compactify coordinates using identifications. Some compact spaces 
are constructed in other ways. In string theory, however, compact spaces that arise from 
identifications are particularly easy to work with. We shall focus on such spaces throughout 
this book. 

Quick calculation 2.5 Consider the plane ( x , y ) with the identification 

(x , y) ~ (x + 2tcR, y + 2 tc R ) . (2.103) 



What is the resulting space? Hint: the space is most clearly exhibited using a fundamental 
domain for which the line x + y = 0 is a boundary. 



35 



2.8 Orbifolds 



2.8 Orbifolds 



Sometimes identifications have fixed points, points that are related to themselves by the 
identification. For example, consider the real line parameterized by the coordinate x and 
subject to the identification x x. The point x = 0 is the unique fixed point of the identi- 

fication. A fundamental domain can be chosen to be the half-line x > 0 (Figure 2.8). Note 
that the boundary point x — 0 must be included in the fundamental domain. The space 
obtained by the above identification is in fact the fundamental domain x > 0. This is the 
simplest example of an orbifold, a space obtained by identifications that have fixed points. 
An orbifold is singular at the fixed points. While the half-line x > 0 is a conventional 
one-dimensional manifold for x > 0, neighborhoods of the point x — 0 fail to be typi- 
cal. This orbifold is called an R. 1 /Z 2 orbifold. Here R 1 stands for the (one-dimensional) 
real line, and Z 2 describes a basic property of the identification when it is viewed as the 
transformation x — > —x: if applied twice, it gives back the original coordinate. 

Certain two-dimensional cones can be obtained as orbifolds. Begin with the (x, y) plane 
and identify every point with the image obtained by rotation around the origin through the 
angle 2 Jt/N, where /V > 2 is an integer. A simple description of the identification makes 
use of the complex coordinate z = x + iy: 

Z ~ e~n~ z . (2.104) 

This identification does as expected: multiplication of any complex number by a phase 
e ,a (a real) rotates the complex number by the angle a. The identification is of Z ; y type: 

2 Tii 

viewed as the transformation z, if applied N times, it gives back the original 

coordinate. A point must be identified with all the N — 1 images obtained by repeated 
action of the transformation. The only fixed point of the Z,y transformation is the origin 
z — 0. A fundamental domain for (2.104), as we will explain below, is provided by the 
points z that satisfy the constraint 

0<arg (z)<^. (2.105) 

Here we recall that for z = re' f> , with r and 6 real, we have arg(z) = 6. The fundamental 
domain is shown in Figure 2.9. To the right one sees the cone, obtained by gluing the 
rays arg(z) = 0 and arg(z) = 2 n/N using the identification (2.104). The resulting cone is 
called the C/Zjv orbifold, where C denotes the complex plane, namely, the original two- 
dimensional plane equipped with a complex coordinate. The cone is singular at the apex 
z = 0 in the sense that it has concentrated curvature. 




0 



x 



The identification x ~ -x on the real line yields the half-line. This is the R 1 /Z 2 orbifold. 
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The fundamental domain for the identification z ~ e « z is shown shaded. After 
identification we obtain a cone - the C/Z w orbifold. 



Let us explain why the region defined by (2.105) is a fundamental domain. Acting 

2ni 

repeatedly on the region with the transformation z, we find N — 1 images that 

together with the region cover seamlessly the full complex plane. Since each point in the 
region has exactly N — 1 copies, all outside the region, no two points in the region are 
identified. Any point in C that is not in the region must lie in one of its N — 1 copies and 
therefore it has an image in the region. Note that our argument used the fact that N is an 
integer; for N irrational, for example, any point would have an infinite number of images. 
Our construction gives only the cones whose total angle at the apex is 27r divided by an 
integer. Cones with other angles exist, but they are not obtained as orbifolds. The case 
when N is replaced by a rational number is examined in Problem 2.7. Additional examples 
of orbifolds are considered in Problems 2.5, 2.6, and 2.10. 

Physics on spaces with generic singularities is typically complicated and sometimes even 
inconsistent. Orbifolds are spaces with tractable singularities, at least as far as strings are 
concerned. The physics of quantum strings on an orbifold, as we will see in Chapter 13, 
is completely regular because the orbifold arises from identifications applied to a non- 
singular space where the quantum string is simple. The strings on the orbifold inherit that 
simplicity. 



2.9 Quantum mechanics and the square well 



Planck’s constant h appears as the constant of proportionality relating the energy E and 
the angular frequency co of a photon: 



E — hco. 



(2.106) 
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Since w has units of inverse time, ti has units of energy times time. Energy has units of 
Ml?T~ 2 and, therefore, 

[h] = [Energy] x [Time] = ML 2 T~ [ . (2.107) 

The value of Planck’s constant is h ~ 1.055 x 10 -27 erg • s. 

The constant h appears in the basic commutation relations of quantum mechanics. The 
Schrodinger position and momentum operators satisfy 

[x,p] = ih. (2.108) 

If we have several spatial dimensions, then the commutation relations are 



[*'■ Pj 



ihS 



(2.109) 



where the Kronecker delta is defined as in (2.32): 



1 1 if i = j 
|0 if i*j. 



( 2 . 110 ) 



In three spatial dimensions, the indices i and j run from 1 to 3. The generalization of 
quantum mechanics to higher dimensions is straightforward. With d spatial dimensions, 
the indices in (2.109) simply run over the d possible values. 



To set the stage for the analysis of a small extra dimension, we review a standard quantum 
mechanics problem. Consider the time-independent Schrodinger equation 

h 2 t 

-—V 2 i/f(x) + V(x)\l/(x) = Eir(x) (2.111) 

2m 

applied to the case of a one-dimensional square-well potential of infinite height: 



jo if x € (0, a) 
joo if x £ (0, a) . 



( 2 . 112 ) 



For x (0, a), the infinite potential implies \[r(x) = 0. In particular, i/r(0) = i j/(a) — 0. 
This is just the quantum mechanics of a particle living on a segment, as shown in 
Figure 2.10. 



When x e (0, a), the Schrodinger equation becomes 

h 2 d 2 \/f 

v = Ef. 

2m dx 2 

The solutions of (2.1 13), consistent with the boundary conditions, are 



M = J- sin 

V a \ a 



k — 1 , 2, . . ., oo . 



(2.113) 



(2.114) 
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Left: the square-well potential in one dimension. Here the particle lives on a segment. 
Right: in the (x, y) plane the particle must remain within 0 < x < a. The direction y is 
identified as y ~ y + 2nR. The particle lives on a cylinder. 



The value k = 0 is not allowed since it would make the wavefunction vanish everywhere. 
By performing the differentiation indicated in (2.1 13), we see that the energy E k associated 
with the wavefunction i/^ is 



E k = 




(2.115) 



2.10 Square well with an extra dimension 



We now add an extra dimension to the square- well problem (no pun intended!). In addition 
to x, we include a dimension y that is curled up into a small circle of radius R. In other 
words, we make the identification 

(x, y) ~ (x, y + 2 ttR). (2.116) 

The original dimension x has not been changed (see Figure 2.10). Since the y direction has 
been turned into a circle of circumference 2nR, the space where the particle moves is now 
a cylinder. The cylinder has length a and circumference 2 ttR. The potential V (x, y) will 
remain given by (2.1 12) and is y-independent. 

We will see that, as long as R is small, and as long as we look only at low energies, the 
quantum mechanics of the particle on the segment is very similar to the quantum mechan- 
ics of the particle on the cylinder. The only length scale in the original problem is the size 
a of the segment, so small R means R <yCa. 



In two dimensions, the Schrodinger equation (2.1 1 1) becomes 



2 m 



d 2 x jr 3 2 i/f 

dx 2 3 y 2 



= Ex//. 



(2.117) 
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We use separation of variables in order to solve this equation. We let (x, y) — 
i j/ (x) (p (y) and find that the equation takes the form 



h z 



1 d 2 i/(x) h 1 1 d 2 <j>(y) 



= E. 



(2.118) 



2m\j/{x) dx 2 2 m <p(y) dy 2 

The x -dependent and y-dependent terms of this equation must separately be constant, and 
the solutions are of the form ^/(x, y) = tfri < (x)(pi(y), where 



fk (x) = c k sin 



/ knx 



kitx\ 



\ a J ’ 

(j)l (y) = a, sin (^-) + b i cos (^)- 



(2.119) 

(2.120) 



The physics along the x dimension is unchanged, since the wavefunction must still vanish 
at the ends of the segment. Therefore (2.119) takes the same form as (2.114) and k = 

1,2, The boundary condition for </>;(y) arises from the identification y ~ y + 2nR. 

Since y and y + 2 tt K are coordinates that represent the same point, the wavefunction must 
take the same value at these two arguments: 



<t>l(y) =<t>i{y + 2jzR). 



( 2 . 121 ) 



As opposed to (x), the function (p/(y) need not vanish for any y. As a result, the gen- 
eral periodic solution, recorded in (2.120), includes both sines and cosines. The presence 
of cosines allows a nonvanishing constant solution: for l = 0, we get <po(y) = bo. This 
solution is the key to understanding why a small extra dimension does not change the low 
energy physics very much. 

The energy eigenvalues of the faj are 



Ek.l = 



h 2 r/A:7ry 
2m L V a / 



Q 2 l 



( 2 . 122 ) 



These energies correspond to doubly degenerate states when 1^0, because in this case 
(2.120) contains two linearly independent solutions. The extra dimension has changed the 
spectrum dramatically. We will see, however, that if R <$C a, then the low-lying part of 
the spectrum is unchanged. The rest of the spectrum changes, but these changes are not 
accessible at low energies. 

Since / = 0 is permitted, the energy levels £y.<) coincide with the old energy levels 
Ek ! The new system contains all the energy levels of the old system. But it also includes 
additional energy levels. What is the lowest new energy level? To minimize the energy, 
each of the terms in (2.122) must be as low as possible. The minimum occurs when k = 1, 
since k = 0 is not allowed, and 1 = 1, since / = 0 gives us the old levels. The lowest new 
energy level is 



£i,t = 




(2.123) 



When R <£ a, the second term is much larger than the first, and 



E u 



2m 




2 



(2.124) 
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This energy is comparable to that of the level k eigenstate of the original problem (see 
(2.115)) when 



kjr 1 la 

— ~ — — »• k ~ . 

a R tc R 



(2.125) 



Since R is much smaller than a, k is a very large number. So the first new energy level 
appears at an energy far above that of the low-lying original states. We therefore conclude 
that an extra dimension can remain hidden from experiments at a particular energy level 
as long as the dimension is small enough. Once the probing energies become sufficiently 
high, the effects of an extra dimension can be observed. 

Curiously, the quantum mechanics of a string introduces new features. For an extra 
dimension much smaller than the already small string length i s , new low-lying states can 
appear! These correspond to strings that wrap around the extra dimension. They have no 
analog in the quantum mechanics of a point particle, and we will study them in detail in 
Chapter 17. In string theory, the conclusion remains true that no new low energy states arise 
from a small extra dimension, but there is a small qualification: the dimension must not be 
significantly smaller than i s . We will learn in Chapter 17 that, in string theory, the effects 
of a compact dimension of radius smaller than l s cannot be distinguished from those of 
another compact dimension with a radius larger than i s . 



Problems 



Problem 2.1 Exercises with units. 

(a) Find the relation between coulombs (C) and esus. 

(b) Explain the meaning of the unit K (degree kelvin) used for measuring temperatures, 
and explain its relation to the basic length, mass, and time units. 

(c) Construct and evaluate a dimensionless number using the charge e of the electron (as 
defined in the Gaussian system of units), ti, and c. (In Heaviside-Lorentz units, the 
Gaussian e 2 is replaced by |-.) 

Problem 2.2 Lorentz transformations for light-cone coordinates. 

Consider coordinates x^ = (x°, x 1 , x 2 , x 3 ) and the associated light-cone coordinates 

(x+, x ~ , x 2 , x 3 ). Write the following Lorentz transformations in terms of the light-cone 

coordinates. 

(a) A boost with velocity parameter /I in the x 1 direction. 

(b) A rotation with angle 9 in the x 1 , x 2 plane. 

(c) A boost with velocity parameter /I in the x 3 direction. 

Problem 2.3 Lorentz transformations, derivatives, and quantum operators. 

(a) Give the Lorentz transformations for the components a ;/ of a vector under a boost 
along the x 1 axis. 
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(b) Show that the objects transform under a boost along the x 1 axis in the same way 
as the a ^ considered in (a) do. This checks, in a particular case, that partial derivatives 
with respect to upper-index coordinates x 1 ' behave as a four-vector with lower indices, 
which is why they are written as 3^. 

(c) Show that, in quantum mechanics, the expressions for the energy and momentum in 
terms of derivatives can be written compactly as p M = jjjz- 

Problem 2.4 Lorentz transformations as matrices. 

A matrix L that satisfies (2.46) is a Lorentz transformation. Show the following. 

(a) If L\ and Li are Lorentz transformations so is the product L \ L 2 . 

(b) If L is a Lorentz transformation so is the inverse matrix L~ l . 

(c) If L is a Lorentz transformation so is the transpose matrix L 1 . 

Problem 2.5 Constructing simple orbifolds. 

(a) Consider a circle 5 1 , presented as the real line with the identification x ~ x + 2. 
Choose — 1 < x < 1 as the fundamental domain. The circle is the space — 1 < x < 1 
with the points x = ±1 identified. The orbifold S 1 /2L is defined by imposing the (so- 
called) Z 2 identification x ~ — x. Describe the action of this identification on the circle. 
Show that there are two points on the circle that are left fixed by the Z 2 action. Find a 
fundamental domain for the two identifications. Describe the orbifold S 1 /Z 2 in simple 
terms. 

(b) Consider a torus T 2 , presented as the (x, y) plane with the identifications x ~ x + 2 

and y ~ y + 2. Choose — 1 < x, y < 1 as the fundamental domain. The orbifold 

T 2 /Z 2 is defined by imposing the Z 2 identification (x, y) ~ (— x, — y). Prove that 
there are four points on the torus that are left fixed by the Z 2 transformation. Show 
that the orbifold T 2 / Z 2 is topologically a two-dimensional sphere, naturally presented 
as a square pillowcase with seamed edges. 

Problem 2.6 Constructing the T 2 / Z 3 orbifold. 

Consider the complex plane z = x + iy subject to the following two identifications 
z ~ T\(z) = z + 1 , and z ~ T 2 (z) = z + e l7t ^ . 

(a) A fundamental domain, with its boundary, is the parallelogram with corners at z = 0, 1 , 
and e ,7T / 3 . Where is the fourth corner? Make a sketch and indicate the identifications 
on the boundary. The resulting space is an oblique torus. 

(b) Consider now the additional Z 3 identification 

z ~ R(z) — e 27T, / 3 z . 

To understand how this identification acts on the oblique torus, draw the short diagonal 
that divides the torus into two equilateral triangles. Describe carefully the Z 3 action on 
each of the two triangles (recall that the action of R can be followed by arbitrary action 
with T \ , T 2 , and their inverses). 
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(c) Determine the three fixed points of the Z 3 action on the torus. Show that the orbifold 
r 2 /Z 3 is topologically a two-dimensional sphere, naturally presented as a triangular 
pillowcase with seamed edges and comers at the fixed points. 

Problem 2.7 A more general construction for cones? 

Consider the (x, y) plane and the complex coordinate z = x + iy. We have seen that the 

2i ri 

identification z ~ e » z, with N an integer greater than two, can be used to construct a 
cone. 



Examine now the identification 

z~e 23,i *z, N>M> 2, 



where M and N are relatively prime integers (their greatest common divisor is one). One 
may guess that a fundamental domain is provided by the points z that satisfy 0 < arg(z) < 
27r ^ . Play with low values of M and N to convince yourself that this is not true. Determine 
a fundamental domain for the identification. [Hint: use the following result. Given two 
relatively prime integers a and b, there exist integers m and n such that mu + nb = 1 . 
Finding m and n is not easy unless you use Euclid’s algorithm. Try to find, for example, 
integers m and n that satisfy 1 87m + 35 n = 1.] 

Problem 2.8 Spacetime diagrams and Lorentz transformations. 

Consider a spacetime diagram in which the x° and x 1 axes of a Lorentz frame S are rep- 
resented as vertical and horizontal axes, respectively. Show that the x'° and x' 1 axes of 
the Lorentz frame S', related to S via (2.36), appear in the original spacetime diagram as 
oblique axes. Find the angle between the primed and unprimed axes. Show in detail how 
the axes appear when fi > 0 and when /3 < 0, indicating in both cases the directions of 
increasing values of the coordinates. 

Problem 2.9 Lightlike compactification. 



The identification x ~ x + 2 tc R, is the statement that the coordinate x has been compact- 
ified into a circle of radius R. In this identification, the time dimension is left untouched. 
Consider now the strange “lightlike” compactification, in which we identify events with 
position and time coordinates related by 




( 1 ) 



(a) Rewrite this identification using light-cone coordinates. 

(b) Consider coordinates (ct ' , x ) related to (ct, x) by a boost with velocity parameter fi. 
Express the identifications in terms of the primed coordinates. 



To interpret (1) physically, consider the family of identifications 





+ 2n 



V**+*?\ 



(2) 



where R s is a length that will eventually be taken to zero, in which case (2) reduces to ( 1). 
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(c) Show that there is a boosted frame S' in which the identification (2) becomes a standard 



identification (i.e., the space coordinate is identified but the time coordinate is not). 
Find the velocity parameter of S' with respect to S and the compactihcation radius in 
this Lorentz frame S'. 

(d) Represent your answer to part (c) in a spacetime diagram. Show two points related by 
the identification (2) and the space and time axes for the Lorentz frame S' in which the 
compactihcation is standard. 

(e) Fill in the blanks in the following statement: Lightlike compactihcation with radius 
R arises by boosting a standard compactihcation with radius . . . with Lorentz factor 
y ~ R/ . . ., in the limit as 0. 

Problem 2.10 A spacetime orbifold in two dimensions. 

Consider a two-dimensional world with coordinates x° and x 1 . A boost with velocity 
parameter /3 along the x 1 axis is described by the hrst two equations in (2.36). We want to 
understand the two-dimensional space that emerges if we identify 



We are identifying spacetime points whose coordinates are related by a boost! 

(a) Use the result of Problem 2.2, part (a), to recast (1) as 

(x + , x _ ) ~ (e~^x + , e A x - ^, where e x = J ^ ^ . (2) 

What is the range of k? What is the orbifold hxed point? Assume now that fi > 0, and 
thus k > 0. 

(b) Draw a spacetime diagram, indicate the x + and x~ axes, and sketch the family of 
curves 



where a > 0 is a real constant that labels the various curves. Indicate which curves 
have small a and which have large a. For each value of a, equation (3) describes two 
disconnected curves. Show that the identification (2) relates points on each separate 
curve. 

(c) Use the expression — < is 2 = —2 dx + dx~ for the interval to show that any curve in (3) 
is spacelike. 

(d) Consider the two curves x + x _ = a 2 for some fixed a. The identification (2) makes 
each one of these curves into a circle. Find the invariant circumference of this circle 
by integrating the appropriate root of ds 2 between two neighboring identified points. 
Give your answer in terms of a and /.. Answer: V2 ak. 

Roughly, as time goes from minus infinity to plus infinity, the parameter a goes from 
infinity down to zero and then back to infinity. This orbifold represents a universe where 
space is a circle. The circle begins large, contracts to zero size, and then expands again. This 
orbifold has one pathology: the curves x + x _ = —a 2 are actually closed timelike circles. 




( 1 ) 



+ - 2 
xx = a , 



(3) 
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Problem 2.11 Extra dimension and statistical mechanics. 

Write a double sum that represents the statistical mechanics partition function Z(ci, R) for 
the quantum mechanical system considered in Section 2.10. Note that Z(a. R) factors as 
Z(a, R) = Z(a)Z(R). 

(a) Explicitly calculate Z(a, R) in the very high temperature limit (/l = — > 0). Prove 

that this partition function coincides with the partition function of a particle in a two- 
dimensional box with sides a and 2jtR. This shows that, at high temperatures, the 
effects of the extra dimension are visible. 

(b) Assume that R a in such a way that there are temperatures that are large as far as the 
box dimension a is concerned, but small as far as the compact dimension is concerned. 
Write an inequality involving kT and other constants to express this possibility. Eval- 
uate Z(a. R) in this regime, but include the leading correction due to the small extra 
dimension. 
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Electromagnetism and gravitation in 
various dimensions 



r 



As a candidate theory of all interactions, string theory includes Maxwell elec- 
trodynamics and its nonlinear cousins, as well as gravitation. We review the 
relativistic formulation of four-dimensional electrodynamics and show how it 
facilitates the definition of electrodynamics in other dimensions. We give a brief 
description of Einstein’s gravity and use the Newtonian limit to discuss the 
relation between Planck's length and the gravitational constant in various dimen- 
sions. We study the effect of compactification on the gravitational constant and 
explain how large extra dimensions could escape detection. 



3.1 Classical electrodynamics 



Unlike Newtonian mechanics, classical electrodynamics is a relativistic theory. In fact, 
Einstein was led by considerations of electrodynamics to formulate the special theory of 
relativity. Electrodynamics has a particularly elegant formulation in which the relativistic 
character of the theory is manifest. This relativistic formulation allows a natural exten- 
sion of the theory to higher dimensions. Before we discuss the relativistic formulation we 
must review Maxwell’s equations. These equations describe the dynamics of electric and 
magnetic fields. 

Although most undergraduate and graduate courses in electrodynamics nowadays use the 
international system of units (SI units), the Heaviside-Lorentz system of units is far more 
convenient for discussions that involve relativity and extra dimensions. In this system of 
units. Maxwell’s equations take the following form: 



V x E = , 


(3.1) 


c 3 1 




V • B = 0, 


(3.2) 


V • E = p, 


(3.3) 


1 - 13 E 




V x 8 = - / f . 

c c dt 


(3.4) 



The above equations imply that E and B are measured with the same units. The first two 
equations are the source-free Maxwell equations. The second two involve sources: the 
charge density p, with units of charge per unit volume, and the current density j , with 
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units of current per unit area. The Lorentz force law, which gives the rate of change of the 
relativistic momentum of a charged particle in an electromagnetic field, takes the form 



dp 

dt 



q 




(3.5) 



Since the magnetic field B is divergenceless, it can be written as the curl of a vector, the 
well known vector potential A : 

B = V x A. (3.6) 



In electrostatics the electric field E has zero curl, and it is therefore written as (minus) the 
gradient of a scalar, the well known scalar potential <J>. In electrodynamics, as equation 
(3.1) indicates, the curl of E is not always zero. Substituting (3.6) into (3.1), we find a 
linear combination of E and of the time derivative of A that has zero curl: 

/- 1 3A\ 

Vx ( E + ;nr)= a <37) 

The object inside parentheses is set equal to — V<J>, and the electric field E can be written 
in terms of the scalar potential and the vector potential: 

1 3A 

E = V3>. (3.8) 

c 3 1 

While the potentials (<t>. A) introduced above seem to be just auxiliary quantities used 
to represent electric and magnetic fields, we learn in quantum mechanics that, in fact, the 
potentials are more fundamental than the E and B fields. The Hamiltonian that describes 
the motion of a charged particle uses the potentials, not the fields. It is therefore relevant 
to examine possible ambiguities in the definition of the potentials. As we now show, the 
potentials associated with a set of E and B fields are not unique. 

If we change A into A' = A + Ve, where e is an arbitrary function of space and time, 
the new magnetic field B' is equal to the old one: 

B' = VxA' = VxAfVxVe = B, (3.9) 



noting that the curl of a gradient is zero. The change in A would not leave E unchanged, as 
it is clear from (3.8). We can repair this, however, by having <J> change too. In fact, letting 

, 1 de 

cj> — >• (J/ = <J> 

c 3 1 ’ 

A — » A' = A + Ve, (3.10) 



neither B nor E is changed. The changes of the potentials indicated above are called gauge 
transformations and e is the gauge parameter. 



Quick calculation 3.1 Verify that E, as given in (3.8), is invariant under the gauge 
transformations (3.10). 



Two sets of potentials (<t>. A) and (O', A') that are related by gauge transformations 
are physically equivalent. It follows that physically equivalent sets of the potentials give 
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identical electric and magnetic fields. It can happen, however, that potentials (<t>, A) and 
(O', A') give the same electric and magnetic fields, but still one cannot find an e such that 
(3.10) holds. In such case, the potentials are not gauge equivalent and must be considered 
physically different, even if their E and B fields are the same! This surprising situation can 
occur in spacetimes with compact spatial dimensions and will feature in our later studies 
of D-branes (Section 18.3). It does not happen in Minkowski space. 

In the presence of compact spatial dimensions a related subtlety occurs. Given some 
E and B there may not exist potentials $ and A that satisfy (3.6) and (3.8) and are well 
defined throughout the compact part of the space. The gauge transformations come to our 
help. It is not strictly necessary to have uniquely defined potentials (<J>, A) all over the 
compact space. A set of potentials defined on patches that cover fully the compact space is 
admissible if in the regions of overlap between any two patches the corresponding poten- 
tials are related by gauge transformations. Given our statement that potentials are needed 
in quantum mechanics, we must conclude that a configuration of E and B fields that does 
not arise from admissible potentials cannot be discussed. 

By the introduction of potentials, the source-free Maxwell equations (3.1) and (3.2) are 
automatically satisfied. Equations (3.3) and (3.4) contain additional information. They are 
used to derive equations for <J> and A. 



3.2 Electromagnetism in three dimensions 



What is electromagnetism in three spacetime dimensions? One way to produce a theory 
of electromagnetism in three dimensions is to begin with the four-dimensional theory and 
eliminate one spatial coordinate. This procedure is called dimensional reduction. 

In four spacetime dimensions, both electric and magnetic fields have three spatial com- 
ponents: ( E x , E y , E z ) and ( B x , B y , B z ), respectively. It may seem likely that a reduction 
to a world without a z coordinate would require dropping the z components from the two 
fields. Surprisingly, this does not work! Maxwell’s equations and the Lorentz force law 
make it impossible. 

In order to construct a consistent three-dimensional theory, we must ensure that the 
dynamics does not depend on the z direction, the direction that we want to eliminate. If 
there is motion, it must remain restricted to the (x, y) plane. It is thus natural to require that 
no quantity should have z-dependence. This does not necessarily mean dropping quantities 
with a z index. 

The Lorentz force law (3.5) is a useful guide to the construction of the lower-dimensional 
theory. Suppose that there is no magnetic field. Then, in order to keep the z component of 
momentum equal to zero we must have E z = 0; the z component of the electric field must 
go. The case of the magnetic field is more surprising. Assume that the electric field is zero. 
If the velocity of the particle is a vector in the (x, y) plane, a component of the magnetic 
field in the plane would generate, via the cross product, a force in the z direction. On the 
other hand, a z component of the magnetic field would generate a force in the (x, y) plane! 
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We conclude that B x and B y must be set equal to zero, while we can keep B z . All in all, 



The left-over fields E x , E y , and B~ can only depend on x and y . In the three-dimensional 
world with coordinates t, x, and y, the z index of B- is not a vector index. Therefore, in 
this reduced world, B z behaves like a Lorentz scalar (more precisely, it is an object called 
a pseudo-scalar). In summary, we have a two-dimensional vector E and a scalar field B z . 



We can test the consistency of this truncation by taking a look at the x and y components 
of (3.1): 



Since the right-hand sides are set to zero by our truncation, the left-hand sides should 
vanish as well. Indeed, they do. Each term on the left-hand sides equals zero, either because 
it contains an E z , or because it contains a z derivative. You may examine the consistency 
of the remaining equations in Problem 3.3. 

While setting up three-dimensional electrodynamics was not too difficult, it is much harder 
to guess what five-dimensional electrodynamics should be. As we will see next, the mani- 
festly relativistic formulation of Maxwell’s equations immediately gives the appropriate 
generalization to other dimensions. 



In the relativistic formulation of Maxwell’s equations neither the electric field nor the mag- 
netic field becomes part of a four-vector. Rather, a four-vector is obtained by combining 
the scalar potential <t> with the vector potential A: 



E z = B x = By = 0. 



(3.11) 



3 E z d E y _ 1 3 B x 

3 y dz c dt 

3 E x 3 E z _ 1 3 B y 

dz dx c dt 



(3.12) 



3.3 Manifestly relativistic electrodynamics 




(3.13) 



The corresponding object with down indices is 




(3-14) 



From A m , we create an object known as the electromagnetic field strength F /lv : 



Ffj,v = d nA v - dyA^. 



(3.15) 



Here 3^ = jF. Equation (3.15) implies that F /lv is antisymmetric: 

Fn V = -F VI1 . 



(3.16) 
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It follows from this property that all diagonal components of F /xv vanish: 

Foo = Fn = F 22 — F 33 = 0. (3.17) 



Let us calculate a few entries in F jlv . Let i denote a spatial index, that is, an index that can 
take the values 1, 2, and 3. Making use of (3.15) and (3.8), we find 



_ 9 A; _ 

0l 3x° 3x' 

Similarly, we can calculate Fu: 



1 3 A' 30 
c 31 + dx' 



(3.18) 



F 1 2 = 3iA2 - 32Aj = d x A y - 3 y A x = B-, (3.19) 



since B — V x A. Continuing in this manner, we can compute all the entries in the matrix 
Fp V : 



(0 


-E x 


-Ey 


~E Z \ 


E x 


0 


B : 


Sy 


Ey 


-IF 


0 


By 


\E Z 


By 


-B x 


0 / 



We see that the electric and magnetic fields E and B are encoded in the field strength F IIV . 

The gauge transformations (3.10) discussed before can be nicely summarized with index 
notation as 



^ + V- (3-21) 

Here A jL and A' jx are the gauge related potentials and, as before, the gauge parameter e(x) 
is an arbitrary function of the spacetime coordinates. 



Quick calculation 3.2 Verify that the gauge transformations (3.10) are correctly summa- 
rized by (3.21). 

Since gauge transformations leave the E and B fields invariant, the field strength F /lv 
must be gauge invariant. Indeed, we readily verify that 

3 (iA'„ - dyA'^ 

d/i(A v + d v e ) — 3 y (A p + d/j.e) 

F [iv + d/id v e - d v d^e 

F[, v . (3.22) 

In the last step we noted that partial derivatives commute. 




Recall that the use of potentials to represent E and B automatically solves the source-free 
Maxwell equations (3.1) and (3.2). How are these equations written in terms of the field 
strength F^ y ? They must be written so that they hold when (3.15) holds. Consider the 
following combination of field strengths: 



Tx[iv = dxF^v + 3 fiF v x + 3 v Fxii. 



(3.23) 
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Txiiv vanishes identically on account of (3.15): 



(d nA v - d v An) + 3 m (d v Ax - 3aA„) + 3 y (3>.A^ - d^Ax) = 0, (3.24) 

using the commutativity of partial derivatives. The vanishing of Tx^v, 



3 kFp V + 3 iiFvx + dvFxn = 0 , 



(3.25) 



is a set of differential equations for the field strength. These equations are precisely 
the source-free Maxwell equations. To make this clear, first note that 7x /iy satisfies the 
antisymmetry conditions 



These two equations follow from (3.23) and the antisymmetry property F^ v = — F vl , of the 
field strength. They state that T changes sign under the transposition of any two adjacent 
indices. 

Quick calculation 3.3 Verify the equations in (3.26). 

Any object, with however many indices, that changes sign under the transposition of every 
pair of adjacent indices, will change sign under the transposition of any two indices: to 
exchange any two indices you need an odd number of transpositions of adjacent indices 
(do you see why?). An object that changes sign under the transposition of any two indices 
is said to be totally antisymmetric. Therefore T is totally antisymmetric. 

Since T is totally antisymmetric, it vanishes when any two of its indices take the same 
value. T is nonvanishing only when each of its three indices takes a different value. In 
such case, different orderings of these three fixed values will give T components that can 
differ at most by a sign. Since we are setting T to zero these various orderings do not 
give new conditions. Because we have four spacetime coordinates, selecting three differ- 
ent indices can only be done in four different ways - leaving out a different index each 
time. Thus the vanishing of T gives four nontrivial equations. These four equations are the 
three components of equation (3.1) and equation (3.2). The vanishing of 7bi2, for example, 
gives us 



This is the z component of equation (3.1). The other three choices of indices lead to the 
remaining three equations (Problem 3.2). 

How can we describe Maxwell equations (3.3) and (3.4) in our present framework? Since 
these equations have sources, we must introduce a current four- vector: 




(3.26) 




(3.27) 




(3.28) 
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where p is the charge density and j — (y , j 2 , y 3 ) is the current density. In addition, we 
raise the indices of the field tensor to obtain the field tensor with upper indices: 

F ^ (3.29) 

Quick calculation 3.4 Show that 

F n'> = - F '>n > F Oi = _p 0 .' pij _ F ..' (3 30 ) 

Equation (3.29), together with the definition of F /jLV , gives 

F ^ = ^ a r, v ^d a Ap - 3 p A a ) = V^dai^Ap) - p(r,^A a ), (3.31) 



where the constancy of the metric components was used to move them across the deriva- 
tives. It is customary to apply the rules for raising and lowering indices to partial 
derivatives, so we write 3^ = rj lla d a . As a result. 



= d^A v - d v A 11 . 

It follows from (3.30) and (3.20) that 



/ 0 


E x 


Ey 


EA 


-E x 


0 


B z 


-By 


-Ey 


~B Z 


0 


By 


\-E z 


By 


~B X 


0/ 



(3.32) 



(3.33) 



Using this equation and the current vector (3.28), we can encapsulate Maxwell’s equations 
(3.3) and (3.4) as (Problem 3.2) 



3 F^ 1 „ 




= - j 1 . 

dx v c J 


(3.34) 



In the absence of sources this equation becomes 

3„ F> JV = 0 — * d v d M A V - 3 2 A 11 = 0, (3.35) 

where we have written 3 2 = 3^3^. 

Equations (3.15) together with equations (3.34) are equivalent to Maxwell’s equations in 
four dimensions. We will take these equations to define Maxwell theory in arbitrary dimen- 
sions. In d spatial dimensions the Lorentz vector A 11 has components (<I>, A) where A is a 
fit-dimensional spatial vector. 

In three-dimensional spacetime, for example, the matrix F^ v is a 3-by-3 antisymmetric 
matrix, obtained from (3.20) by discarding the last row and the last column: 

/ o -E x —E y \ 

F flv = E x 0 B z I . 

V Ey —B z 0 ) 



(3.36) 
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This immediately reproduces the main result of Section 3.2; B x , B y , and E- are to be set 
to zero. 

Motivated by (3.33), in arbitrary dimensions we will call F 0 ' the electric field E , : 

Ei = F 0i = -F 0i . (3.37) 

The electric field is a spatial vector. Equation (3.18) implies that, in any number of 
dimensions, 

1 3A 

E= Vcb. (3.38) 

c 3 t 

The magnetic field is identified with the F lJ components of the field strength. In four- 
dimensional spacetime F ,J is a 3-by-3 antisymmetric matrix. Its three independent entries 
are the components of the magnetic field vector (see (3.33)). In dimensions other than four, 
the magnetic field is no longer a spatial vector. In three spacetime dimensions the magnetic 
field is a single-component object. In five spacetime dimensions the magnetic field has as 
many entries as a 4-by-4 antisymmetric matrix, six entries. That many components do not 
fit into a spatial vector. 

Our next goal is the determination of the electric field produced by a point charge in a 
spacetime with an arbitrary but fixed number of spatial dimensions. To this end, we must 
first learn how to calculate the volumes of higher-dimensional spheres. We turn to this 
subject now. 



3.4 An aside on spheres in higher dimensions 



Since we want to work in various numbers of dimensions we should be precise when 
speaking about spheres and their volumes. When we speak loosely we tend to confuse 
spheres and balls, at least in the precise sense in which they are defined in mathematics. 
When you say that the volume of a sphere of radius R is ^irR 2 , you should really be saying 
that this is the volume of the three-ball B 3 - the three-dimensional space enclosed by the 
two-dimensional two-sphere S 1 . In three-dimensional space R 3 with coordinates x\,X 2 , 
and X 3 , we write the three-ball as the region defined by 

B\R) : x\ + x\ + x\ < R 2 . (3.39) 

This region is enclosed by the two-sphere: 

S 2 (R) : x\ + xl + xl = R 2 . (3.40) 

The superscripts in B or S denote the dimensionality of the space in question. When we 
drop the explicit argument R, we mean that R = 1. Lower-dimensional examples are also 
familiar. B 2 is a two-dimensional disk - the region enclosed in R 2 by the one-dimensional 
unit radius circle S l . In arbitrary dimensions we define balls and spheres as subspaces 
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B d (R) : xl + x% + --- + xj < R 2 . (3.41) 

This is the region enclosed by the sphere S d ~ l (R): 

S d ~ l (R) : xl+xl + ---+x} = R 2 . (3.42) 

One last piece of terminology: to avoid confusion we will always speak of volumes. If a 
space is one-dimensional we take volume to mean length. If a space is two-dimensional we 
take volume to mean area. All higher-dimensional spaces have just volumes. The volumes 
of the one- and two-dimensional spheres are 

vol (S l (R)) = 2: xR, 

voI(S 2 (R)) = 4ttR 2 . (3.43) 

Unless you have worked with other spheres before, you probably do not know what the 
volume of S 3 is. 

Since volume has units of length to the power of the space dimension, the volume of a 
sphere of radius R is related to the volume of a sphere of unit radius by 

vol (S d ~ l (R)) = R d ~ l vol (S d ~ l ). (3.44) 



Since the radius dependence of the volume is easily recovered, it suffices to record the 
volumes of unit spheres: 

vol (S 1 ) = 2rc, 

vol (S 2 ) = 4 t r. (3.45) 



Let us now begin our calculation of the volume of the sphere S d 1 . For this purpose 
consider with coordinates x\,X 2 , ■ ■ x,j, and let r be the radial coordinate: 

r 2 = x\ + x\ H F x 2 d . (3.46) 

We will find the desired volume by evaluating in two different ways the following integral: 

Id = I dx\dx 2 ■ ■ ■ dxd e~' . (3.47) 

m d 

First we proceed directly. Using (3.46) in the exponential factor, the integral becomes a 
product of d Gaussian integrals: 



Id 



fl r dxi e -h 2 = (^) d = n d < 2 . 
i = 1 



(3.48) 



Now we proceed indirectly. We do the integral by breaking ItU / into thin spherical shells. 
Since the space of constant r is the sphere S d ~ 1 (r), the volume of a shell lying between r 
and r + dr equals the volume of S d ~ l (r) times dr. Therefore, 



-L 



Id = I drvo\(S d 1 (r))e ' = vol (S' 



r c 

S d ~ l ) I 
Jo 



drr d - l e ~ r2 



= u ol<yM) /. 



-t A-l 



dte ' 1 2 



(3.49) 
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where use was made of (3.44), and in the final step we changed the variable of integra- 
tion to t = r 2 . The last integral on the right-hand side can be expressed in terms of the 
gamma function, a very useful special function. For positive x the gamma function F (x) is 
defined by 

nOO 

r(x)= / dte~'t x -\ x > 0. (3.50) 

Jo 

Unless x > 0 the integral does not converge near t — 0. With this definition, equation 
(3.49) becomes 

7 rf = ivol(5 rf - 1 )r(f). (3.51) 

Comparing with the earlier evaluation (3.48), we get our final result: 



, | 2 7T d / 2 

vol (S d - l ) = — 7 -. 


(3.52) 


r (l) 





It now remains to calculate the value of T{d/2). Since d is an integer, we must determine 
the values of the gamma function for both integer and half-integer arguments. To find 
F(l/2) we use the definition (3.50) and let t — u 2 : 

( j\ /»oo /»oo 

-J = J dte~' r l/2 = 2 J due~ u =J~i r. (3.53) 



Similarly, 



F(l) = 




(3.54) 



For larger arguments the calculation of the gamma function is simplified using a recursion 
relation. To obtain this relation, begin with 



/•CO 

T(x + 1) = / dte~'t x , 

Jo 



x > 0 



(3.55) 



which can be rewritten as 

"CO 



/ h \ r / d \ 

r(x + l ) = ~J dt \dt e ) tX = ~] (3.56) 



The boundary terms vanish for x > 0 and we find that 



r(x + 1) = x r(x), x > o. 



(3.57) 



Using this recursion relation we find, for example. 




For integer arguments the gamma function is related to the factorial: 



(3.58) 



r(5) = 4 • r(4) = 4 • 3 • r(3) = 4 • 3 • 2 • r(2) = 4 ■ 3 • 2 • 1 • T(l) = 4!. 




55 



3.5 Electric fields in higher dimensions 



Therefore, for n e Z and n > 1 , we have 

r(n) = (n - 1)!, (3.59) 

where we recall that 0! = 1. We can now test our formula (3.52) in the familiar cases: 

vol (S 1 ) = vol (S 2-1 ) = — = 2 7T, 

r(i) 

2 3 1 2 7T 3 / 2 

vol (S 2 ) = vol (S 3_1 ) = — ^ = 4 tt, (3.60) 

r (l) 

in agreement with the known values. For the less familiar S 3 we find 

2jt 2 

vol (S 3 ) = vol (S 4-1 ) = = 2 7T 2 . (3.61) 

r(2) 

Quick calculation 3.5 Show that vol (B d ) = n dl2 / T (l + |). 

3.5 Electric fields in higher dimensions 



In this section we calculate the electric field due to a point charge in a world with d spatial 
dimensions. Flere d could be three, in which case the answer is familiar, or less than three, 
but we are particularly interested in d > 3. To do this calculation we will use the general 
version of Maxwell’s equations appropriate for an arbitrary number of spatial dimensions. 
As you may imagine, the electric field of a point charge is radial. Our calculation will give 
the radial dependence and the normalization of the electric field. With minor modifications, 
this result will also inform us about the gravitational fields of point particles in d spatial 
dimensions. 



Our computation is based on the zeroth component of equation (3.34): 

dF 0i 

dx‘ ~ P ' 

Since F° l = E, (see (3.37)), this equation is just Gauss’ law: 

V ■ E = p. 



(3.62) 



(3.63) 



Gauss’ law is valid in all dimensions! Equation (3.63) can be used to determine the electric 
field of a point charge. Let us first review how this is done in the familiar setting of three 
spatial dimensions. 

Consider a point charge q, a two-sphere S 2 (r ) of radius r centered on the charge, and 
the three-ball fi 3 (r) whose boundary is the two-sphere. We integrate both sides of equation 
(3.63) over the three-ball to find 

I d(v ol) V ■ E = f d(yol) p. 

Jb 3 Jb 3 



(3.64) 
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We use the divergence theorem on the left-hand side and note that the volume integral on 
the right-hand side gives the total charge: 




■ da = q . 



(3.65) 



Since the magnitude E(r) of E is constant over the two-sphere, we get 



vol(S 2 (r)) E(r) = q. 



The volume of the two-sphere is just its area 4 jt r 2 , so 



E(r) = 



q 

4 nr 2 



(3.66) 

(3.67) 



This is the familiar result for the electric field of a point charge in three spatial dimensions. 
The electric field magnitude falls off like 1 / r 2 . 



For dimensions higher than three, the starting point (3.63) is good, so we must ask if the 
divergence theorem also holds. It turns out that it does. We will first state the theorem in d 
spatial dimensions, and then we will give some justification for it. 

Consider a d-dimensional subspace V d of and let dV d denote the boundary of V d . 
Moreover, let £ be a vector field in M . The divergence theorem states that 

/ d(vol) V • E = Flux of E across dV d = E ■ dv. (3.68) 

Jv d J dV d 

The last right-hand side requires some explanation. At any point on dV d , the space 9 V d is 
locally approximated by the ( d — 1) -dimensional tangent hyperplane. For a small piece of 
dV d around this point, the associated vector dv is a vector orthogonal to the hyperplane, 
pointing out of the volume, and with magnitude equal to the volume of the small piece 
under consideration. Note that this explanation is in accord with your experience in K 3 , 
where dv corresponds to the area vector element da. 



Let us justify the divergence theorem for the case of four space dimensions. Following a 
strategy used in elementary textbooks, it suffices to prove the divergence theorem for a 
small hypercube - the result for general subspaces follows by breaking such spaces into 
many small hypercubes. Because it is not easy to imagine a four-dimensional hypercube, 
we might as well use a three-dimensional picture with four-dimensional labels (Figure 3.1). 
We use Cartesian coordinates x, y, z, w, and consider a cube whose faces lie on hyper- 
planes selected by the condition that one of the coordinates is constant. Let one face of 
the cube and the face opposite to it lie on hyperplanes of constant x and constant x + dx, 
respectively. The outgoing normal vectors are e x , for the face at x + dx. and (—e x ), for the 
face at x. The volume of each of these two faces equals dydzdw, where dy , dz, and dw, 
together with dx, are the lengths of the edges of the cube. For an arbitrary electric field 
E (x , y, z, w), only the x component contributes to the flux through these two faces. The 
contribution is 

9 E x 

[ E x (x + dx, y, z, w) — E x (x, y, z, w) ] dydzdw — — — dxdydzdw. (3.69) 
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An attempt at a representation of a four-dimensional hypercube. The two faces of constant 
x are shown (shaded) together with their outgoing normal vectors. 



Analogous expressions hold for the flux across the three other pairs of faces. The total net 
flux from the little cube is just 

-> /dE x dEy dE- dE w \ 

Flux of £ = ( — - + — * + — - + — - ) dxdydzdw = V • E d (vol). (3.70) 

V 3x 3 y 3 z aw / 

This result is precisely the divergence theorem (3.68) applied to an infinitesimal hypercube. 
This is what we wanted to show. 

We can now return to the computation of the electric field due to a point charge in a world 
with d spatial dimensions. Consider a point charge q, the sphere S d ~ 1 (r ) of radius r cen- 
tered on the charge (this is the sphere that surrounds the charge), and the ball B d (r) whose 
boundary is the sphere S d ~ x (r). Again, we integrate both sides of equation (3.63) over the 
ball B d (r): 

I c/(vol)V • E = I d (vol) p . (3.71) 

J B d JB d 

The volume integral on the right-hand side gives the total charge, and the divergence 
theorem (3.68) relates the left-hand side to a flux integral: 

Flux of E across S d ~\r ) = q. (3.72) 

The flux equals the magnitude of the electric field times the volume of S d ~ l (r), so 

E(r)vol(S d -'(r)) = q. (3.73) 



Making use of (3.52) we find 



E(r) = 



r (f) q 

2n d l 2 r d ~ * 



(3.74) 
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This is the value of the electric field for a point charge in a world with d spatial dimen- 
sions. For d = 3 we recover the inverse-squared dependence of the electric field. In higher 
dimensions the electric field falls off faster at large distances. For each additional spatial 
dimension we get an additional factor of 1/r in the radial dependence of the electric field. 

Quick calculation 3.6 Verify that for = 3 equation (3.74) coincides with (3.67). 

Quick calculation 3.7 The force f on a test charge q in an electric field E is F — qE. 
What are the units of charge in various dimensions? 

The electrostatic potential <J> is also of interest. For time independent fields (3.38) gives 

E = — V<J>. (3.75) 

This equation, together with Gauss’ law, gives the Poisson equation: 

V 2 <F = - p , (3.76) 

which can be used to calculate the potential due to a charge distribution. The two equations 
above hold in all dimensions using, of course, the appropriate definitions of the gradient 
and the Laplacian. 



3.6 Gravitation and Planck's length 



Einstein’s theory of general relativity is a theory of gravitation. In this very elegant the- 
ory the dynamical variables encode the geometry of spacetime. When gravitational fields 
are sufficiently weak and velocities are small, Newtonian gravitation is accurate enough, 
and one need not work with the more complex machinery of general relativity. We can 
use Newtonian gravity to understand the definition of Planck’s length in various dimen- 
sions and its relation to the gravitational constant. These are interesting issues that we will 
explain here and in the rest of the present chapter. Nevertheless, when gravitation emerges 
in string theory, it does so in the language of Einstein’s theory of general relativity. To be 
able to recognize the appearance of gravity among the quantum vibrations of the relativis- 
tic string you need a little familiarity with the language of general relativity. Here you will 
take a first look at the concepts involved in this remarkable theory. 

Most physicists do not expect general relativity to hold at truly small distances nor for 
extremely large gravitational fields. This is a realm where string theory, the first seri- 
ous candidate for a quantum theory of gravitation, is necessary. General relativity is the 
large-distance/weak-gravity limit of string theory. String theory modifies general relativity; 
it must do so to make it consistent with quantum mechanics. The conceptual frame- 
work which underlies these modifications is not clear yet. It will no doubt emerge as we 
understand string theory better in the years to come. 

The spacetime of special relativity, Minkowski spacetime, is the arena for physics in the 
absence of gravitational fields. The geometrical properties of Minkowski spacetime are 
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encoded by the metric formula (2.21), which gives the invariant interval separating two 
nearby events: 

- ds 2 = r j/xv dx fl dx v . (3.77) 

Here the Minkowski metric r] /lv is a constant metric, represented as a matrix with entries 
(— 1, 1, . . ., 1) along the diagonal. Minkowski space is said to be a flat space. In the 
presence of a gravitational field, the metric becomes dynamical. We then write 

— ds 2 = g iLV (x)dx^dx v , (3.78) 



where the constant i] llv is replaced by the metric g^ v (x). If there is a gravitational field, 
the metric is in general a nontrivial function of the spacetime coordinates. The metric g^ v 
is defined to be symmetric 

gnv(x) = gvii(x). (3.79) 

It is also customary to define g IJ - v (x) as the inverse of the g^vix) matrix: 

g'“*(x)g av (x) = 8!t. (3.80) 

For many physical phenomena gravity is very weak, and the metric g^vix) can be chosen 
to be very close to the Minkowski metric >} ln .. We then write, 

gfiv(x) = ihiv + h^vix), (3.81) 

and we view h llv (x) as a small fluctuation around the Minkowski metric. This expansion 
is done, for example, to study gravity waves. Those waves represent small “ripples” on 
top of the Minkowski metric. Einstein’s equations for the gravitational field are written in 
terms of the spacetime metric g llv (x). These equations imply that matter or energy sources 
curve the spacetime manifold. For weak gravitational fields, Einstein’s equations can be 
expanded in powers of h lll: using (3.81). In the absence of sources, the resulting linearized 
equation for h^ v is 

- d u (d ll h va + S 1 ’/?'"") + d^d v h = 0. (3.82) 

Here h flv = i)^ a r} v ^h a p and h = ri^h^y — —hoo + h\\ + I 122 + ^ 33 - Equation (3.82) is 
the gravitational analog of equation (3.35), which describes Maxwell fields in the absence 
of sources. While (3.35) is exact, (3.82) is only valid for weak gravitational fields. It is the 
linear approximation to a nonlinear equation that includes additional terms quadratic and 
higher order in h . 

The analogy with electromagnetism extends to the existence of gauge transformations. 
Einstein’s gravity has gauge transformations. They arise because the use of different sys- 
tems of coordinates yields equivalent descriptions of gravitational physics. In learning 
string theory in this book you will get to appreciate the freedom to choose coordinates 
on the surfaces generated by moving strings. In general relativity, an infinitesimal change 
of coordinates 



x* = x /i + e /i (x), 



(3.83) 
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can be viewed as an infinitesimal change of the metric g^ v and, using (3.81), as an 
infinitesimal change of the fluctuating field h 11 1! . One can show that the change is given by 

Sh^ = S 0 h^ v + 0(e, h ) , with Soh^ = + d v e^. (3.84) 

As indicated above, the infinitesimal change Sh^ is given by So h^ v plus corrections, writ- 
ten as € ■ 0(h), that are linear in e and linear in the fluctuation h itself. The invariance 
of the full nonlinear equation of motion under the gauge transformation Sh^ v requires the 
invariance of the linearized equation of motion (3.82) under SqIi ,1v . Indeed, when we vary 
the h s in (3.82) using we get terms linear in e but without an h. These terms must 

cancel out completely because all other variations will contain at least one field h : this is 
clear for the variations of (3.82) using the terms represented by e ■ O(li) and for all varia- 
tions of terms quadratic and higher order in h in the complete equation of motion. We will 
check the invariance of (3.82) under the transformation SqIi 11 '’ in Chapter 10. In Maxwell 
theory the gauge parameter has no indices, but in general relativity the gauge parameter 
has a vector index. 

As we mentioned before, Newtonian gravitation emerges from general relativity in the 
approximation of weak gravitational fields and motion with small velocities. For many pur- 
poses Newtonian gravity suffices. Starting now, and for the rest of this chapter, we will use 
Newtonian gravity to understand the definition of Planck’s length in various dimensions, 
and to investigate how gravitational constants behave when some spatial dimensions are 
curled up. The results that we will obtain hold also in the full theory of general relativity. 



Newton’s law of gravitation in four dimensions states that the force of attraction between 
two masses ni\ and m 2 separated by a distance r is given by 



?( 4 ), _ Gm \mi 
* I 1 
r z 



(3.85) 



where G denotes the four-dimensional Newton constant. It follows that the units of the 
gravitational constant G are 



[G] = [Force] 



L 2 

M 2 



ML L 2 

'Wai 2 



L 3 

MT 2 ' 



The numerical value for the constant G is determined experimentally: 



(3.86) 



G = 6.674 x 1CT 11 (3.87) 

kg • s^ 

Since [c] = L/T and [7;] = ML 2 /T , the three fundamental constants G, c, and h can be 
written as 



G = 6.674 x ltT 11 — c = 2.998 x 10 8 -, h = 1.055 x 10~ 34 kg ’ m , 
kg • s 2 s s 

(3.88) 

In the study of gravitation it is sometimes convenient to use a “Planckian” system of 
units. Since we have three basic units, those of length, time, and mass, we can find new 
units of length, time, and mass such that the three fundamental constants, G, c, and h take 
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the numerical value of one in those units. These units are called the Planck length ip, the 
Planck time fp, and the Planck mass m p, respectively. In those units 



G = 






(3.89) 



without additional numerical constants - as opposed to equation (3.88). The above 
equations allow us to solve for ip, fp, and m p in terms of G, c, and h. One readily finds 



fp = ,/ — j 1 = 1.6 1 6 x 10 33 cm, 



£ P [Gh 



tp = - = J-^- = 5.391 x 10“ 44 s, 
c V c° 



mp = ^1 — = 2.176 x 10 5 g. 
G 



(3.90) 

(3.91) 

(3.92) 



These numbers represent scales at which relativistic quantum gravity effects can be impor- 
tant. Indeed, the Planck length is an extremely small length, and the Planck time is an 
incredibly short time - the time it takes light to travel the Planck length! While Einstein’s 
gravity can be used down to relatively small distances and back to relatively early times in 
the history of the universe, a quantum gravity theory (such as string theory) is needed to 
study gravity at distances of the order of the Planck length or to investigate the universe 
when it was Planck-time old. 

There is an equivalent way to characterize the Planck length: ip is the unique length that 
can be constructed using only powers of G, c, and h. One thus sets 



lp = (G) a {cf (h)Y, 



(3.93) 



and fixes the constants a, /3, and y so that the right-hand side has units of length. 



Quick calculation 3.8 Show that this condition fixes uniquely a = y = l/2, and [ J > = 
— 3/2, thus reproducing the result in (3.90). 



It may appear that mp is not a very large mass, but it is, in fact, a spectacularly large mass 
from the viewpoint of elementary particle physics. The mass m p is roughly 1 0 1 9 times 
larger than the mass of the proton. If the fundamental theory of nature is based on the basic 
constants G, c, and h, it is then a great mystery why the masses of the elementary particles 
are so much smaller than the “obvious” mass mp that can be built from the basic constants. 
This puzzle is usually called the hierarchy problem. 



For an additional perspective on the Planck mass, consider the following question: what 
should be the mass M of the proton so that the gravitational force between two protons 
cancels the electric repulsion force between them? Equating the magnitudes of the electric 
and gravitational forces we get 



GM 2 
r 2 



4 nr 2 



GM - 



e 



2 



4jt 



(3.94) 
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It is convenient to divide both sides of the equation by he to find 

GM 2 _ e 2 _ 1 M 2 1 

he ~ Arctic ~ 737 ~ 'm 2 ~ 137’ (3 ' 9 ) 

where use was made of (3.92). We thus find M ~ m p/12, or about one-tenth of the 
Planck mass. The dimensionless ratio e 2 /(Arche) is called the fine structure constant. 
It was evaluated above using the Heaviside-Lorentz definition of electric charge where 
e = \J~Arc 4.8 x 10 -10 esu (see Problem 2.1(c)). 

Quick calculation 3.9 The mass of the electron is m e = 0.9109 x 10 -27 g, and its energy 
equivalent is m e c 2 = 0.51 10 MeV. Show that the energy equivalent of the Planck mass is 
m P c 2 = 1.221 x 10 19 GeV (1 GeV = 10 9 eV). This energy is called the Planck energy. 

3.7 Gravitational potentials 



We want to learn what happens with the gravitational constant G when we attempt to 
describe gravitation in spacetimes of other dimensionalities. To find out, we will examine 
gravitational potentials in Newtonian gravity. In this section we obtain the equation that 
relates the gravitational potential to the mass distribution in a spacetime with arbitrary but 
fixed number of spatial dimensions. In doing so we will learn how to define the relevant 
gravitational constant. This result will be used in the following section to define, in any 
dimension, the Planck length in terms of the appropriate gravitational constant. 



We introduce a gravity field g with units of force per unit mass. The definition is similar 
to that of an electric field in terms of the force on a test particle: the force on a given test 
mass m at a point where the gravity field is g is given by mg. We set g equal to minus the 
gradient of a gravitational potential V ? : 

g = -W g . (3.96) 

We will take this equation to be true in all dimensions. Equation (3.96) has content: if you 
move a particle along a closed loop in a static gravitational field, the net work that you do 
against the gravitational field is zero. 

Quick calculation 3.10 Prove the above statement. 



What are the units for the gravitational potential? Equation (3.96) gives 

[Force] [V„] [Energy] 

[g] = = ^ — * [VJ = - — . 

5 M L g M 



(3.97) 



The gravitational potential has units of energy per unit mass in any dimension. The 
gravitational potential V^ ] of a point mass in four dimensions is 



y(4) = 
8 



GM 



r 



(3.98) 
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We can use the electromagnetic analogy to find the equation satisfied by the gravitational 
potential. In electromagnetism, we found an equation for the electrostatic potential which 
holds in any dimension. This is equation (3.76): 

V 2 cl> = -p . (3.99) 

The four-dimensional scalar potential for a point charge q is 

$( 4 ) = ( 3 . 100 ) 

47rr 

and it satisfies (3.99) where p is the charge density for the point charge. It follows by 
analogy that the four-dimensional gravitational potential in (3.98) satisfies 

V 2 V^ 4) = 4jrGp m , (3.101) 

where p m is the matter density. While this equation is correct in four dimensions, a small 
modification is needed for other dimensions. Note that the left-hand side has the same units 
in any number of dimensions: the units of V g are always the same, and the Laplacian always 
divides by length-squared. The right-hand side must also have the same units in any number 
of dimensions. Since p m is mass density, it has different units in different dimensions, and, 
as a consequence, the units of G must change when the dimensions change. We therefore 
rewrite the above equation more precisely as 

v 2 y ? ( 0 ) = 4 nG (D) p m , (3.102) 

when working in /7-dimensional spacetime. The superscripts shown in parentheses denote 
the dimensionality of spacetime. In particular, we identify G* 4 -* as the four-dimensional 
Newton constant G. In general, we will use D to denote the dimensionality of spacetime, 
and d to denote the number of spatial dimensions. Clearly, D = d + 1. 

Equation (3.102) defines Newtonian gravitation in an arbitrary number of dimensions. 
Just as the electric field of a point charge does, the gravitational field of a point mass falls 
off like 1 in a world with d spatial dimensions. As a result, the force between two 
point masses separated by a distance r falls off like 1 /r d ~ x . For three spatial dimensions, 
this is the familiar inverse-squared dependence of the gravitational force. If D = 6 (a world 
with two extra dimensions) the gravitational force falls off like 1 /r 4 . 

3.8 The Planck length in various dimensions 



We define the Planck length in any dimension just as we did in four dimensions: the Planck 
length is the unique length built using only powers of the gravitational constant G [,)) , c, 
and ft. To compute the Planck length we must determine the units of G ( D 1 . This is easily 
done if we recall that the units of G {D) p m (the right-hand side of (3.102)) are the same in 
all dimensions. 

Comparing the cases of five and four dimensions, for example, 

[G (5) ] -jj = [G] -jj — ► [G (5) ] = L [G], 



(3.103) 
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The units of G (5) carry one more factor of length than the units of G. We use (3.90) to read 
the units of G in terms of units of length and units of c and h : 



[c] 3 L 2 

[G] = (3.104) 

[h] 



(3.104) 



Equation (3.103) then gives 




(3.105) 



Since the Planck length is constructed uniquely from the gravitational constant, c, and h, 
we can remove the brackets in the above equation and replace L by the five-dimensional 
Planck length €p 5) : 



Since they do not have the same units, the gravitational constants in four and five dimen- 
sions cannot be compared directly. Planck lengths, however, can be compared. If the Planck 
length is the same in four and in five dimensions, then G (5) /G = Ip; the gravitational 
constants differ by one factor of the common Planck length. 

It is not hard to generalize the above equations to D spacetime dimensions: 

Quick calculation 3.11 Show that (3.106) and (3.107) are replaced by 



3.9 Gravitational constants and compactification 



If string theory is correct, our world is really higher-dimensional. The fundamental grav- 
ity theory is then defined in the higher-dimensional world, with some value for the 
higher-dimensional Planck length. Since we observe only four dimensions, the additional 
dimensions may be curled up to form a compact space with small volume. We can then 
ask: what is the effective value of the four-dimensional Planck length? As we shall show 
here, the effective four-dimensional Planck length depends on the volume of the extra 
dimensions, as well as on the value of the higher-dimensional Planck length. 

These observations raise the possibility that the Planck length in the effectively four- 
dimensional world - the famous number equal to about 10 -33 cm - may not coincide with 




(3.106) 



Reintroducing the four-dimensional Planck length: 




(3.107) 




(3.108) 
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the fundamental Planck length in the original higher-dimensional theory. Is it possible that 
the fundamental Planck length is much bigger than the familiar, four-dimensional one? We 
will answer this question in the following section. In this section we will work out the 
effect of compactification on gravitational constants. 

How do we calculate the gravitational constant in four dimensions if we are given the grav- 
itational constant in five? First, we recognize the need to curl up one spatial dimension, 
otherwise there is no effectively four-dimensional spacetime. As we will see, the size of 
the extra dimension enters the relationship between the gravitational constants. To explore 
these questions precisely, consider a five-dimensional spacetime where one dimension 
forms a small circle of radius R. We are given G (5) and we would like to calculate G (4) . 

Let (x 1 , x 2 , x 3 ) denote three spatial dimensions of infinite extent, and x 4 denote a com- 
pactified dimension of circumference 2jt R (Figure 3.2). We place a uniform ring of total 
mass M all around the circle at x 1 = x 2 = x 3 = 0. This is a mass distribution which is 
constant along the x 4 dimension. We are interested in the gravitational potential V% 5) that 
emerges from such a mass distribution. Alternatively, we could have placed a point mass 
at some fixed x 4 , but this makes the calculations more involved (Problem 3.10). In the 
present case the gravitational potential V^ 5) does not depend on x 4 . The total mass M can 
be written as 

total mass = M = 2nRm, (3.109) 

where m is the mass per unit length. 

What is the mass density in the five-dimensional world? It is only nonzero at x 1 = x 2 = 
x 3 = 0. To represent such a mass density we use delta functions. Recall that the delta 
function <5 (x ) can be viewed as a singular function whose value is zero except for x = 0 

and such that the integral dx&(x) — 1. This integral implies that if x has units of 

length, then 8(x) has units of inverse length. Since the five-dimensional mass density is 
concentrated at x 1 = x 2 = x 3 = 0, it is reasonable to include in its formula the product 
5(x 1 )i5(x 2 )i5(x 3 ) of three delta functions. We claim that 

p (5) = mS(x l )S(x 2 )S(x 3 ). (3.110) 



M 




A world with four space dimensions, one of which, x 4 , is compactified into a circle of 
radius R. A ring of total mass M wraps around this compact dimension. 
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We first check the units. The mass density p (5) must have units of M/L 4 . This works out 
since m has units of mass per unit length, and the three delta functions supply an additional 
factor of L -3 . The ansatz in (3.1 10) could still be off by a constant dimensionless factor, 
a factor of two, for example. As a final check, we integrate p <5) over all space. The result 
should be the total mass: 



f 



dx l dx 2 dx 2 



I >2ttR 

Jo ‘ 



dx p 



4 A5) 



/ oo poo poo p2nR 

dx l 8(x l ) I dx 2 8(x 2 ) I dx 2 S(x 3 ) / dx 
-oo J — oo J — OO Jo 



= m 2 tt R . 



(3.111) 



This is indeed the total mass on account of (3.109). For the effectively four-dimensional 
observer the mass is point-like, and it is located at x 1 = x 2 — x 3 — 0. So this observer 
writes 

p (4) = M8{x l )8{x 1 ')8{x i ). (3.112) 



Note the relation 



P (5) = 



1 

2 tzR 



A 4 ) 



(3.113) 



Let us now use this information in the equations for the gravitational potential. Using 
the five-dimensional version of (3.102) and (3.113), we find 



V 2 vpV, 



x 2 , x 3 ) = 4 jtG 



(«p (5) = 4n 



G (5) 
2 ttR 



,( 4 ) 



(3.114) 



As we have noted before, Vg 5 ' ) is independent of x 4 so the Laplacian above is actually the 
four-dimensional one. Since the effective four-dimensional gravitational potential is Vg 5 \ 
the above equation takes the form of the gravitational equation in four dimensions, where 
the constant in between the 4 tt and the p (4> is the four-dimensional gravitational constant. 
We have therefore shown that 



G (5) 

2ttR 



G (S) 

—pr = 2it R = l c . 



(3.115) 



where Iq is the length of the extra compact dimension. This is what we were seeking: a 
relationship between the strength of the gravitational constants in terms of the size of the 
extra dimension. 



The generalization of (3.1 15) to the case where there is more than one extra dimension is 
straightforward. One finds that 

G (0) 

-p- = dc) D ~\ (3.116) 

where lc is the common length of each of the extra dimensions. When the various dimen- 
sions are curled up into circles of different lengths, the above right-hand side must be 
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replaced by the product of the various lengths. This product is, in fact, the volume Vq of 
the extra dimensions, so that 



G (D> 




— = Vc. 


(3.117) 


G 



3.10 Large extra dimensions 



We are now done with all the groundwork. In Section 3.8 we found the relation between 
the Planck length and the gravitational constant in any dimension. In Section 3.9 we 
determined how gravitational constants are related upon compactification. We are ready 
to find out how the fundamental Planck length in a higher-dimensional theory with 
compactification is related to the Planck length in the effectively four-dimensional theory. 



To begin with, consider a five-dimensional world with Planck length £p 5) and a single 
spatial coordinate curled up into a circle of circumference ic . How are these lengths related 
to £p? From (3.107) and (3.115) we find that 



Solving for ic, we get 



( 4 5) ) 3 = (^p ) 2 



G (5) 

~G~ 



= (€ P ) 2 Zc- 



(-c 



(4 5) ) 3 

(f P ) 2 ‘ 



(3.118) 



(3.119) 



This relation enables us to explore the possibility that the world is actually five-dimensional 
with a fundamental Planck length £p 5) that is much larger than 10 -33 cm. Of course, we 
must have ip ~ 10 -33 cm. After all, this is the four-dimensional Planck length, whose 
value is given in (3.90). 

Present-day accelerators explore physics down to distances of the order of 10“ 16 cm. If 
this distance, or a somewhat smaller one, is the fundamental length scale, we may choose 
ip ' ~ 10 -18 cm. What would ic have to be? With £p 5) ~ 10 -18 cm and ip ~ 10 -33 cm, 
equation (3.119) gives Iq ~ 10 12 cm ~ 10 7 km. This is more than twenty times the dis- 
tance from the earth to the moon. Such a large extra dimension would have been detected 
a long time ago. 

Having failed to produce a realistic scenario in five dimensions, let us try in six spacetime 
dimensions. For arbitrary I), equations (3.108) and (3.116) give 



(4 D} )° 2 = = (^p) 2 (^c) D “ 4 . 



Solving for ic we find 



(D) _J_ 

0-4 






(3.120) 



(3.121) 
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For D = 6 and i^ ] 



10 1 8 cm this formula gives 



ic 



(4 6) ) 2 

fp 



1CT 3 cm. 



(3.122) 



This is a lot more interesting! A convenient unit here is the micron |xm = 10 -6 m. One- 
tenth of a millimeter is 100 p,m. We found Iq ~ 10 pm. Could there be extra dimensions 
ten microns long? You might think that this is still too big, since even microscopes probe 
smaller distances. Moreover, as we indicated before, accelerators probe distances of the 
order of 10 -16 cm. Surprisingly, it is possible that “large extra dimensions” exist and that 
we have not observed them yet. 

The existence of additional dimensions may be confirmed by testing the force law which 
gives the gravitational attraction between two masses. For distances much larger than the 
compactification scale £c the world is effectively four-dimensional, so the dependence of 
the force between two masses on their separation must follow accurately Newton’s inverse- 
squared law. On the other hand, for distances smaller than ic , the world is effectively 
higher-dimensional, and the force law will change. A force between two masses that goes 
like 1/r 4 , where r is the separation, is consistent with the existence of two compact extra 
dimensions. 

It turns out to be very difficult to test gravity at small distances; the force of gravity is 
extremely weak and spurious electrical forces must be cancelled very precisely. Motivated 
mainly by the possible existence of large extra dimensions, physicists set out to test the 
inverse-squared law at distances smaller than one millimeter. The tabletop experiments use 
a torsion pendulum detector or, alternatively, a micromachined cantilever with a test mass 
at the free end. As of 2007, experiments have found no departure from the inverse-squared 
law down to distances of about fifty microns. This means that extra dimensions, if they 
exist, must be smaller than this distance. Compact dimensions the size of ten microns, as 
we found in (3.122), are still consistent with experiment. 

You might ask: what about forces other than gravity? Electromagnetism has been tested 
to much smaller distances, and we know that the electric force obeys an inverse-squared 
law very accurately. Rutherford scattering of alpha particles off nuclei, for example, 
confirms that the inverse-squared law holds down to 10 -11 cm. Since the separation 
dependence of the electric force would change at distances smaller than the size of the 
extra dimensions, this seems to rule out large extra dimensions. The possibility of 
large extra dimensions, however, survives in string theory, where our spatial world could 
be a three-dimensional hyperplane transverse to the extra dimensions. This hyperplane is 
called a D3-brane. A D3-brane is a D-brane with three spatial dimensions. 

Open strings have the remarkable property that their endpoints must remain attached to 
the D-branes. In many phenomenological models built in string theory, it is the fluctuations 
of open strings that give rise to the familiar leptons, quarks, and gauge fields, including the 
Maxwell gauge field. It follows that these fields are bound to the D3-brane and do not feel 
the extra dimensions. If the Maxwell field lives on the D-brane, the electric field lines of a 
charge remain on the D-brane and do not go off into the extra dimensions. The force law is 
not changed at any distance scale. Closed strings are not bound by D-branes, and therefore 
gravity, which arises from closed strings, is affected by the extra dimensions. 
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Although the Planck length £p is an important length scale in four dimensions, if there 
are large extra dimensions, the truly fundamental Planck length would be much bigger than 
the effective four-dimensional one. The possibility of large extra dimensions is slightly 
unnatural - why should the extra dimensions be much larger than the fundamental length 
scale? This is not a new problem, however, but rather the problem of a large hierarchy in 
another guise. We noted earlier that particle physics faces a puzzling hierarchy between 
the Planck mass and the masses of elementary particles. In the large-extra-dimensions 
scenario, the hierarchy is postulated to arise from extra dimensions that are much larger 
than the fundamental length scale. At any rate, the truly exciting fact is that present 
experimental constraints do not rule out large extra dimensions. The discovery of extra 
dimensions would be revolutionary. 

It is possible, of course, that extra dimensions are much larger than £p ~ 10 -33 cm but 
are still quite small. If spacetime is ten dimensional, setting D = 10 and fp 10) ~ 10 -18 cm 
in (3.121) gives Iq ~ 10 -13 cm, a distance far too small to be probed with tabletop grav- 
itational experiments. Extra dimensions this size or smaller must be searched for using 
particle accelerators. 



Problems 



Problem 3.1 Lorentz covariance for motion in electromagnetic fields. * 

The Lorentz force equation (3.5) can be written relativistically as 

dp u q dx v 

= - F (1 ) 

ds c ds 

where p jL is the four- momentum. Check explicitly that this equation reproduces (3.5) when 
p. is a spatial index. What does (1) give when // = O'? Does it make sense? Is (1) a gauge 
invariant equation? 

Problem 3.2 Maxwell equations in four dimensions. 



(a) Show explicitly that the source-free Maxwell equations emerge from T^xv — 0. 

(b) Show explicitly that the Maxwell equations with sources emerge from (3.34). 

Problem 3.3 Electromagnetism in three dimensions. 

(a) Find the reduced Maxwell equations in three dimensions by starting with Maxwell’s 
equations and the force law in four dimensions, using the ansatz (3.11), and assuming 
that no field can depend on the z direction. 

(b) Repeat the analysis of three-dimensional electromagnetism starting with the Lorentz 
covariant formulation. Take A 11 = (<T>, A 1 , A 2 ), examine the Maxwell equations 
(3.34), and the relativistic form of the force law derived in Problem 3.1. 

Problem 3.4 Electric fields and potentials of point charges. 

(a) Show that for time-independent fields, the Maxwell equation Top = 0 implies that 
3,- Ej — dj Ei = 0. Explain why this condition is satisfied by the ansatz E = —VO. 
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(b) Show that with d spatial dimensions, the potential <t> due to a point charge q is given by 



<D(r) = 



r(f-i) q 

4 Jr rf/2 r d — 2 ' 



Problem 3.5 Calculating the divergence in higher dimensions. 

Let f — f(r) r be a vector function in B/ 1 . Here r is a unit radial vector, and r is the radial 
distance to the origin. Derive a formula for V ■ / by applying the divergence theorem to 
a spherical shell of radius r and width dr. Check that for d — 3 your answer reduces to 
V-/ = /'(r) + ?/(r). 

Problem 3.6 Analytic continuation for gamma functions.^ 



Consider the definition of the gamma function for complex arguments z whose real part is 
positive: 



L 



T(z) = / dte-'t 



-t f z - 1 



9t(z) > 0. 



Use this equation to show that for SH(z) >0 

N 

r (z) = 



r 1 , N (_ f)n. N 

u n = 0 n=0 



( - 1 )” 1 
n ! z + n 



r 



dt e t 






Explain why the above right-hand side is well defined for 91(z) > —N — 1. It follows that 
this right-hand side provides the analytic continuation of L(z) for S)t(z) > —N — 1. Con- 
clude that the gamma function has poles at 0, — 1 , —2, . . ., and give the value of the residue 
at z = —n (with n a positive integer). 



Problem 3.7 Simple quantum gravity effects are small. ^ 



(a) What would be the “gravitational” Bohr radius for a hydrogen atom if the attraction 
binding the electron to the proton was gravitational? The standard Bohr radius is ao = 

~ 5.29 x 1CT 9 cm. 

mr 

(b) In “units” where G, c, and h are set equal to one, the temperature of a black hole is 
given by kT = Insert back the factors of G, c, and h into this formula. Evaluate 
the temperature of a black hole of a million solar masses. What is the mass of a black 
hole whose temperature is room temperature? 



Problem 3.8 Vacuum energy and an associated length scale. 

Observations indicate that the expansion of the universe is currently accelerating possibly 
due to a vacuum energy density. The mass density associated with this energy is approxi- 
mately p vac = 7.7 x 10 -27 kg/m 7 . Some physicists try to understand the acceleration of 
the universe by introducing modifications to gravity. It is then useful to know what length 
scales could be important. If one assumes that the only relevant parameters are p vac , h, and 
c, one can construct a length parameter l v!ic by multiplying powers: 

■^vac = Pvac ^ ■ 
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Problem 3.10: a point mass M in a five-dimensional spacetime with one compact dimension. 



What must be the values of a, /J, and y in the above equation? What is the numerical value 
of £ va c? Express your answer in |xm, where 1 |i m = 10~ 6 m. 

Problem 3.9 Planetary motion in four and higher dimensions. 

Consider the motion of planets in planar circular orbits around heavy stars in our four- 
dimensional spacetime and in spacetimes with additional spatial dimensions. We wish to 
study the stability of these orbits under perturbations that keep them planar. Such a per- 
turbation would arise, for example, if a meteorite moving on the plane of the orbit hits the 
planet and changes its angular momentum. 

Show that while planetary circular orbits in our four-dimensional world are stable under 
such perturbations, they are not so in five or higher dimensions. [Hint: you may find it 
useful to use the effective potential for motion in a central force field.] 

Problem 3.10 Gravitational field of a point mass in a compactified five-dimensional 
world. 

Consider a five-dimensional spacetime with space coordinates (x, y, z, w) not yet com- 
pactified. A point mass M is located at the origin (x, y, z, w) = (0, 0, 0, 0). 

(a) Find the gravitational potential V^ 5 \r). Write your answer in terms of M, G <5 \ 
and r = (x 2 + y 2 + z 2 + w 2 ) 1 / 2 . [Hint: use V 2 V ? <5) = 4 t rG^p m and the divergence 
theorem.] 

Now let w become a circle with radius a while keeping the mass fixed, as shown in 
Figure 3.3. 

(b) Write an exact expression for the gravitational potential V ? (5) (x, y, z, 0). This potential 
is a function of R = (x 2 + y 2 + - 2 ) 1 22 and can be written as an infinite sum. 
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(c) Show that for R a the gravitational potential takes the form of a four-dimensional 
gravitational potential, with Newton’s constant G (4) given in terms of G (5) as in 
(3.115). [Hint: turn the infinite sum into an integral.] 

These results confirm both the relation between the four- and five-dimensional New- 
ton constants in a compactification and the emergence of a four-dimensional potential at 
distances large compared to the size of the compact dimension. 

Problem 3.1 1 Exact answer for the gravitational potential. 



The infinite sum in Problem 3.10 can be evaluated exactly using the identity 



E 



Yl— — OO 



1 

1 + (mix) 2 




(a) Find an exact closed-form expression for the gravitational potential V^ 5) (x, y, z, 0) in 
the compactified theory. 

(b) Expand this answer to calculate the leading correction to the gravitational potential in 
the limit when R a. For what value of R/a is the correction of order 1%? 

(c) Use the exact answer in (a) to expand the potential when R a. Give the first two 
terms in the expansion. Do you recognize the leading term? 




A full appreciation for the subtleties of relativistic strings requires an under- 
standing of the basic physics of nonrelativistic strings. These strings have mass 
and tension. They can vibrate both transversely and longitudinally. We study 
the equations of motion for nonrelativistic strings and develop the Lagrangian 
approach to their dynamics. 



4.1 Equations of motion for transverse oscillations 



We will begin our study of strings with a look at the transverse fluctuations of a stretched 
string. The direction along the string is called the longitudinal direction, and the directions 
orthogonal to the string are called the transverse directions. We consider, for notational 
simplicity, the case when there is only one transverse direction - the generalization to 
additional transverse directions is straightforward. 

Working in the (x, y) plane, let the classical nonrelativistic string have its endpoints 
fixed at (0, 0), and ( a , 0). In the static configuration the string is stretched along the x axis 
between these two points. In a transverse oscillation, the x-coordinate of any point on the 
string does not change in time. The transverse displacement of a point is given by its y 
coordinate. The x direction is longitudinal, and the y direction is transverse. To describe 
the classical mechanics of a homogeneous string, we need two pieces of information: the 
tension To and the mass per unit length no- The total mass of the string is then M — /xo a. 

Let us look briefly at the units. Tension has units of force, so 

[Energy] 

[T 0 ] = [Force] = ^ . (4.1) 

If you stretch a string an infinitesimal amount dx, its tension remains approximately con- 
stant through the stretching, and the change in energy equals the work done Todx. The 
total mass of the string does not change. If we were considering relativistic strings, how- 
ever, a static string with more energy would have a larger rest mass. Using (4.1), noting 
that energy has units of mass times velocity squared, and that /xo has units of mass per unit 
length, we have 

[ T 0 ] = ^-[vf = [/x 0 ][v] 2 . (4.2) 

For a nonrelativistic string, both Tq and /xo are adjustable parameters, and the velocity on 
the right-hand side above will turn out to be the velocity of transverse waves. The above 
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A short piece of a classical nonrelativistic string vibrating transversely. With different slopes 
at the two endpoints there is a net vertical force. 



equation suggests that the string tension To and the linear mass density /iq in a relativistic 
string might be related by To = /xo c 2 , since c is the canonical velocity in relativity. We will 
see in Chapter 6 that this is indeed the correct relation for a relativistic string. 



Returning to our classical nonrelativistic string, let us figure out the equation of motion. 
Consider a small portion of the static string that extends from x to x + dx, with y = 0. This 
piece is shown in transverse oscillation in Figure 4. 1 . At time t, the transverse displacement 
of the string is y(t, x ) at x and y (t, x + dx) at x + dx. We will assume that the oscillations 
are small, and by this we will mean that at all times 



dy 

dx 



« 1 , 



( 4 . 3 ) 



at any point on the string. This guarantees that the transverse displacement of the string is 
small compared to the length of the string. The length of the string changes little, and we 
can assume that the tension Tq is unchanged. 

The slope of the string is a bit different at the points x and x + dx. This change of 
slope means that the string tension changes direction and the portion of string under con- 
sideration feels a net force. For transverse oscillations we need only calculate the net 
vertical force; the net horizontal force is negligible (Problem 4.1). The vertical force at 
(x + dx, y + dy) is given accurately by Tq times dy /'dx evaluated at x + dx and is point- 
ing up; similarly, the vertical force at (x, y) is Tq times dy/dx evaluated at x and is pointing 
down. Therefore the net vertical force d F v is 



dy 

dF v = 7o/ 
dx 



T 3y 
, ~ Io ^~ 
x+dx dx 



d 2 y 

~ T °3? dx - 



( 4 . 4 ) 



The mass dm of this piece of string, originally stretched from x to x + dx, is given by 
the mass density /xq times dx. By Newton’s law, the net vertical force equals mass times 
vertical acceleration. So we can simply write 



d 2 y 3 2 y 

T °-y? dI = (Mdx) w 



( 4 . 5 ) 



We cancel rfxon each side and rearrange terms to get 
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d 2 y _ mo 9^y 
dx 2 To dt 2 



(4.6) 



This is just a wave equation! Recall that for the wave equation 

9^_1 9 ^ = q 

dx 2 Vq dt 2 



(4.7) 



the parameter vq is the velocity of the waves. Thus for the transverse waves on our stretched 
string, the velocity r>o of the waves is 

vo = y/To/fio- (4.8) 

The higher the tension or the lighter the string, the faster the waves move. 



4.2 Boundary conditions and initial conditions 



Since equation (4.6) is a partial differential equation involving space and time derivatives, 
in order to fix solutions we must in general apply both boundary conditions and initial con- 
ditions. Boundary conditions (B.C.) constrain the solution at the boundary of the system, 
and initial conditions constrain the solution at a given starting time. The most common 
types of boundary conditions are Dirichlet and Neumann boundary conditions. 

For our string, Dirichlet boundary conditions specify the positions of the string end- 
points. For example, if we attach each end of the string to a wall (Figure 4.2, left), we are 
imposing the Dirichlet boundary conditions 

y(t, x = 0) = y(t , x = a) — 0, Dirichlet boundary conditions. (4.9) 

Alternatively, if we attach a massless loop to each end of the string and the loops are 
allowed to slide along two frictionless poles, we are imposing Neumann boundary con- 
ditions. For our string, Neumann boundary conditions specify the values of the derivative 
dy/dx at the endpoints. Since the loops are massless and the poles are frictionless, the 
derivative dy/dx must vanish at the poles x = 0, a (Figure 4.2, right). If this were not 
the case, then the slope of the string at a pole would be nonzero, and a component of the 
string tension would accelerate the rings in the v direction. Since each ring is massless, 





Left: string with Dirichlet boundary conditions at the endpoints. Right: string with 
Neumann boundary conditions at the endpoints. 
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their acceleration would be infinite. This is not possible, so, in effect, we are imposing the 
Neumann boundary conditions 

3y 3 y 

— (t,x — 0) = — — (f, x = a) = 0, Neumann boundary conditions. (4.10) 
dx dx 

These Neumann boundary conditions apply to strings whose endpoints are free to move 
along the y direction. 

Let us see how we can solve the wave equation for a particular set of initial conditions. The 
general solution of equation (4.6) is of the form 

y(t, x) — h+ (x - no t) + h-(x + no t), (4.11) 

where h+ and h- are arbitrary functions of a single variable. This solution represents a 
superposition of two waves, h+ moving to the right and /?_ moving to the left. Suppose 
that the initial values of y and dy/dt are known at time t = 0. Using equation (4.11) we 
see that this information yields the equations 

y(0, x) = h+(x) + h-(x), (4.12) 

^y(0, x) — - voh' + (x ) + voh'_(x), (4.13) 

where the left-hand sides are known functions, and the primes denote derivatives with 
respect to arguments. Using (4.12) we can solve for h- in terms of h+. Substituting into 
(4.13), we get a first-order ordinary differential equation for h + . Once we have solved for 
h + (using appropriate boundary conditions), we can use (4.12) again, this time to find the 
explicit form of h-. With h + and h- known, the full solution of the equations of motion is 
given by (4.11). 



4.3 Frequencies of transverse oscillation 



Suppose that we have a string where each point is oscillating in the y direction sinusoidally 
and in phase. This means that y(t, x) is of the form 

y(t, x) — y(x) sin(tt>t + <p), (4.14) 



where co is the angular frequency of oscillation and 4> is the constant common phase. Our 
aim is to find the allowed frequencies of oscillation. Substituting (4.14) into (4.6) and 
cancelling the common time dependence, we find 



d 2 y(x ) 
dx 2 



+ co 



2 

T 0 



y(x) = o. 



(4.15) 



This is an ordinary second-order differential equation for the profile y(x) of the oscilla- 
tions. The allowed frequencies are selected by this equation, together with the boundary 
conditions. Since co, /iq, and 7 q are constants, the differential equation is solved in terms of 
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trigonometric functions. With Dirichlet boundary conditions (4.9) we have the nontrivial 
solutions 

/mtx\ 

y n (x) = A n sin ^ J, n= 1,2,..., (4.16) 

where A n is an arbitrary constant. The value n — 0 is not included above because it repre- 
sents a motionless string. Plugging y n (x) into (4.15), we find the allowed frequencies co n : 



(On 




n= 1,2, .... 



(4.17) 



These are the frequencies of oscillation for a Dirichlet string. The strings on a violin are 
Dirichlet strings. To tune a violin to the correct frequency one must adjust the string ten- 
sion. The higher the tension is, the higher the pitch, as predicted by (4.17). For the case of 
Neumann boundary conditions (4.10), we obtain the spatial solutions 

/mtx\ 

y n (x) = A,, cos y J, 77 = 0,1,2, (4.18) 

This time the n = 0 solution is a little less trivial: the string does not oscillate, but it is 
rigidly translated to y(t,x ) = Aq. The oscillation frequencies, found by plugging (4.18) 
into (4.15), are the same as those in (4.17). Therefore, the oscillation frequencies are the 
same in the Neumann and Dirichlet problems. The Neumann case admits one extra solution 
not included in our oscillatory ansatz (4.14): the string can translate with constant veloc- 
ity. Indeed, y(t, x) — at + b, with a and b arbitrary constants, satisfies both the boundary 
conditions and the original wave equation (4.7). 



4.4 More general oscillating strings 



Let us discuss briefly some problems that are closely related to the ones considered thus 
far. For example, we can take the mass density of the string to be a function /r(x) of 
position. The form (4.6) of the wave equation does not change since it is derived from 
local considerations: the examination of a little piece of string that can be chosen to be 
sufficiently small so that the mass density is approximately constant. We therefore get 



d 2 y /i(*) d 2 y _ Q 

dx 2 T 0 dt 2 

For normal oscillations, we use the ansatz in (4.14) and find 



(4.19) 




To 



co 2 y(x) = 0. 



(4.20) 



This equation is no longer simple to solve, and it can only be studied in detail once the 
function /r(x) is specified. In Problems 4.3 and 4.7 you will consider some specific mass 
distributions, and you will explore a variational approach that gives an upper bound for the 
lowest oscillation frequency. 



78 



Nonrelativistic strings 



So far we have only considered strings that oscillate transversally. Strings also admit 
longitudinal oscillations, although the relativistic string does not. Imagine a string stretched 
along the x axis, and consider the infinitesimal segment which at equilibrium extends from 
x to x + dx. Suppose now that at time t the ends of this infinitesimal segment are longi- 
tudinally displaced from their equilibrium positions by distances rj(t, x) and rj(t, x + dx), 
respectively. If these two quantities are not the same, the piece of string is being com- 
pressed or stretched. An equation of motion can be obtained for this system, much as we 
did for transverse motion. It is not possible, however, to assume that the tension is constant 
throughout the string. For transverse oscillations the net force acting on a little piece of 
string arose from the different angles at which the same tension was applied on opposite 
ends of the piece. If the string always lies along the x axis then a net force can act on a seg- 
ment only if the tension is different on its two ends. Therefore the waves on a longitudinally 
oscillating string are accompanied by tension waves (Problem 4.2). 



4.5 A brief review of Lagrangian mechanics 



The Lagrangian L of a system is defined by 

L — T — V , (4.21) 

where T is the kinetic energy of the system and V is the potential energy of the system. 
For a point particle of mass m moving along the x axis under the influence of a time- 
independent potential V (x), the nonrelativistic Lagrangian takes the form 

1 t dx(t) 

L(t) = —m (x(t)) 2 — V (x(t)) , i(f) = — f 1 . (4-22) 

2 dt 

We must emphasize that the above Lagrangian is implicitly a function of time, but it has 
no explicit time dependence. All the time dependence arises from the time dependence of 
the position x(t). The action S is defined as 

S= f L(t)dt, (4.23) 

Jv 

where Pisa path x(t) between an initial position x,- at an initial time f; , and a final position 
Xf at a final time t f > ti. One such path is shown in Figure 4.3. 

The action is afunctional. Whereas a function of a single variable takes one number - the 
argument - as input and gives another number as output, a functional takes a function as 
the input, and gives a number as output. Since a function is usually defined by its values 
at infinitely many points, we can think of a functional as a function of infinitely many 
variables. In our present application, the input for the action functional is the function x(t) 
which determines the path V. We can emphasize the argument of S by using the notation 
,S[x]. Here [x] represents the full function xit). It is potentially confusing to write S[x(r)], 
since it suggests that S is ultimately a function of t , which it is not. 
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A path V representing a possible one-dimensional motion x(f) of a particle during the time 
interval [f,-, tf]. 




A path x(t) and its variation x(f) + Sx(t). This variation <$x(f) vanishes at f = f, and at t = t f . 

More explicitly, for any path x(t), the action is given by 

S[x] = J'' f { \m (i(O) 2 ~ V(x(t))^dt. (4.24) 

It is very important to emphasize that the action S can be calculated for any path x(t) 
and not only for paths that represent physically realized motion. It is because S can be 
calculated for all paths that it is a very powerful tool to find the paths that can be physically 
realized. 

Hamilton’s principle states that the path V which a system actually takes is one for which 
the action S is stationary. More precisely, if this path V is varied infinitesimally, the action 
does not change to first order in the variation. In terms of the function x(t) which specifies 
the path, the perturbed path takes the form x{t) + Sx(t), as shown in Figure 4.4. For any 
time r, the variation Sx(t) is the vertical distance between the original path and the varied 
path. As in the figure, we consider variations where the initial and final positions x; = x(t{) 
and xf = x(tf) are unchanged: 



Sx(ti) = Sx(tf) = 0. 



(4.25) 
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We now calculate the action .S'[x + <5x] for the perturbed path x(t) + 8x{t ): 

nt f \ m ( d \ 2 

<dt 



5[x + <5x] = 

= S[xi 



= f' f {|(^U(0 + 5*(0)) 2 - v (x(t) + 8x(t)) J dt 



C I 



mx(f) — 5x(t) - V , (x(r))5x(r) } * + e>((<$x) 2 ). (4.26) 

r/r 



In passing to the last right-hand side we expanded V in a Taylor series about x(t). The 
terms of order {8x) 2 and higher are unnecessary to determine whether or not the action is 
stationary. We have thus left them undetermined and indicated them by 0((<5x) 2 ). We can 
write the new action as S + SS, where SS is linear in 8x. From the equation above we see 
that 8S is given by 



8S = J jmx 



(t)—8x(t) — V'(x(t)) Sx(t ) } dt. 



(4.27) 



To find the equations of motion, the variation 8S must be rewritten in the form 8S = 
f dt Sx(t ){. . .}. In particular, no derivatives must be acting on 8x. This can be achieved 
using integration by parts: 

SS = j' | — (jnx(t)8x(t)) — mx(t)Sx(t ) — V'(x(t )) iSx(t) j dt 

rtf 

;(tf)8x(tf ) — mx(tj)8x(tj ) + / Sx(t)(— mx(t) — V\x(t )) ) dt. (4.28) 

Jti 



mx( 



Making use of (4.25), the variation reduces to 

"tf 



f f 

8S= 8x{t)(— mx{t) — y , (x(f)) ) dt. 

Jt: 



(4.29) 



The action is stationary if 8S vanishes for every variation 8x(t). For this to happen, the 
factor multiplying 8x(t) in the integrand must vanish: 



mx(t) = — V\x{t)). 



(4.30) 



This is Newton’s second law applied to the motion of a particle in a potential V(x). We 
have recovered the expected equation of motion by requiring that the action be stationary 
under variations. 



Suppose that we have determined the path that the particle takes while going from x, to Xf. 
As we have seen, the action is then stationary under variations that vanish at the initial and 
final times. Is the action also stationary under variations that change the initial position at t, 
or the final position at tf ? In general, the answer is no. This can be seen from equation (4.5). 
The integral term vanishes by assumption, but if 8x(tf ) ^ 0, the first term on the right- 
hand side would not vanish unless mx(tf), the final momentum of the particle, happens to 
vanish. The situation is analogous for Sx (t, ) ^ 0. 

Hamilton’s principle states that the action is stationary about the classical solution. The 
classical solution does not always define a minimum of the action. It is possible to construct 
a simple example in which the classical solution is a saddle point of the action functional: 
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the action increases for some variations and decreases for others. See Problem 4.6 for the 
details. 



4.6 The nonrelativistic string Lagrangian 



Let us return now to our string with constant mass density /iq, constant tension 7o, and ends 
located at x — 0 and x — a. The kinetic energy is simply the sum of the kinetic energies of 
all the infinitesimal segments that comprise the string. So it can be written as 

r =r^>(s) 2 ' (4 - 3i) 

The potential energy arises from the work which must be done to stretch the segments. 
Consider an infinitesimal portion of string which extends from ( x , 0) to (x + clx , 0) when 
the string is in equilibrium. If the string element is momentarily stretched from (x, y) 
to (x + dx, y + dy), as in Figure 4.1, then the change in length A l of the infinitesimal 
segment is given by 

A l = yj (dx) 2 + (dy) 2 -dx = dx(J 1 + - l) ~ ^ » ( 4 - 32 ) 



where we have used the small oscillation approximation (4.3) to discard higher-order terms 
in the expansion of the square root. Since the work done in stretching each infinitesimal 
segment is TqAI, the total potential energy V is 



V = 




(4.33) 



The Lagrangian for the string is given by T — V : 



L(t) = 





where C is referred to as the Lagrangian density : 



(4.34) 




(4.35) 



The action for our string is therefore 




In this action the “path” is the function y(f, x), defined over the region of it, x) space 
shown shaded in Figure 4.5. 
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x A 



a 



boundary 

condition 



initial 

condition 



final 

condition 



tj boundary tf 
condition 



> 

t 



The string motion is defined by y(t, x) over the domain t e [f,-, t f ], x e [0, a\. Boundary 
conditions apply at x = 0 and x = a for all t e [tj, tf]. Initial and final conditions apply for 
t = tj and t = t f , respectively, for all x e [0, a]. 



To find the equations of motion, we must examine the variation of the action as we vary: 
y(t,x) — »• y(t, x) + Sy(t, x). Performing the variation as before, we get 



SS = 



- (", r 

Jtj Jo 



dx 



Mo 



dy 905)0 dy d(Sy) 



dt dt 



To 



dx dx 



(4.37) 



Quick calculation 4.1 Prove equation (4.37). 



We must have no derivatives acting on the variations, so we rewrite each of the two terms 
above as a full derivative minus a term in which the derivative does not act on the variation: 



SS = 




dx 



9 

9r 



( m ! s - v ) 



3 To 



9 

dx 





+ To 



d 2 y 

dx 2 



8y . 
(4.38) 



The time derivative reduces to evaluations at tf and tj, while the space derivative gives 
evaluations at the string endpoints: 



SS = 





To 



dy 

dx 



Sy 



Jx=0 



dt 





Sy. 



(4.39) 



Our final expression for SS contains three terms. Each one must vanish independently. 
The third term, for example, is determined by the motion of the string for x e (0, a ) and 
t e ( tj , tr). In this domain Sy(t, x) is not restricted by boundary conditions nor by initial 
or final conditions, so we set to zero the coefficient of Sy and recover our original equation 
(4.6). The first term in (4.39) is determined by the configuration of the string at times tj 
and tf. If we specify these initial and final configurations, we are in effect setting Sy(t , , x) 
and Sy(tf, x) to zero. This causes the first term to vanish. We encountered an analogous 
situation in our study of the free particle. 
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The second term in (4.39) is new. Written out explicitly, it is 




T 0 ^(t,a)Sy(t,a) + T 0 ^-(t, 0) Sy(t,0) 
dx d x 



dt, 



(4.40) 



and it concerns the motion of the string endpoints y(t. 0) and y(t. a). We need a boundary 
condition for each of the two terms above. Let x* denote the x coordinate of an endpoint; x* 
can be equal to zero or equal to a. Selecting an endpoint means fixing the value of x*. We 
can make each term in (4.40) vanish by specifying either Dirichlet or Neumann boundary 
conditions. Consider an endpoint x* and the associated term in (4.40). If we impose a 
Dirichlet boundary condition the position of the chosen endpoint is fixed throughout time, 
and we require that the variation Sy(t, x*) vanishes. This will cause the chosen term to 
vanish. If, on the other hand, we assume that the endpoint is free to move, then the variation 
Sy(t, x*) is unconstrained. The term will vanish if we impose the condition 



dy 

— (t,x*) = 0, 

OX 


Neumann boundary condition. 


(4.41) 


Dirichlet boundary conditions can be written in a form where the similarity to Neumann 
boundary conditions is more apparent. If a string endpoint is fixed, the time derivative of 


the endpoint coordinate must vanish 






dy 

/(fx») = 0, 

ot 


Dirichlet boundary condition. 


(4.42) 



The similarity with (4.41) is quite striking. The only change is that spatial derivatives were 
turned into time derivatives. If we write Dirichlet boundary conditions in this form, we 
must still specify the values of the coordinates at the fixed endpoints. In order to appreciate 
further the physical import of boundary conditions, we consider the momentum p y carried 
by the string. There is no other component to the momentum, because we have assumed 
that the motion is restricted to the y direction. This momentum is simply the sum of the 
momenta of each infinitesimal segment along the string: 



Py 

Let us see if this momentum is conserved: 

dpy(t) f a d 2 y 



dt 



= / 



dx = 



-r 

Jo 

k'ed: 

-r 

Jo 



dy A 

p.o—dx. 

ot 



3 2 y 

Toy— 2 dx = T ° 



'dy' 

dx 



x=0 



(4.43) 



(4.44) 



where we used the wave equation (4.6). We see that momentum is conserved when 
Neumann boundary conditions (4.41) apply at both endpoints. For Dirichlet boundary 
conditions momentum is not generally conserved! Indeed, when the endpoints of a string 
are attached to a wall, the wall is constantly exerting a force on the string. In the lowest 
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normal mode of a Dirichlet string, for example, the net momentum constantly oscillates 
between the +y and —y directions. 

Why is this important for string theory? For a long time string theorists did not take seri- 
ously the possibility of Dirichlet boundary conditions. It seemed unphysical that the string 
momentum could fail to be conserved. Moreover, what could the endpoints of open strings 
be attached to? The answer is that they are attached to D-branes - a new kind of dynamical 
extended object. If a string is attached to a D-brane then momentum can be conserved - the 
momentum lost by the string is absorbed by the D-brane. A detailed analysis of the spatial 
boundary term induced by variation is crucial to recognize the possibility of D-branes in 
string theory. 



We conclude this chapter with a more general derivation of the equation of motion for the 
string. For this, we use (4.35) to write the action as 



S = 




We also define the quantities 





(4.45) 



(4.46) 



with y' = dy/dx. These are simply the derivatives of C with respect to its first and second 
arguments, respectively. Explicitly, they are 

V l =^, V x = — 7o^. (4.47) 

at ax 

When we vary the motion by Sy, the variation of the action is given by 



rtf r a r dC 3 C i C a r i 

8S = J dt J dx[—8y+ — <5/J = J dt J dx yp‘ Sy + V x Sy' J . (4.48) 



Using the standard manipulations we find 



r tf , r , fdV dr x \ „ 

dt L dx (-^ + ^) Sy - 



(4.49) 



Quick calculation 4.2 Derive equation (4.49). 



Quick calculation 4.3 Match in detail equations (4.49) and (4.39). 
The variation in (4.49) gives the equation of motion 

dV dV x „ 

-f- — 0. 

3 1 dx 

Using (4.47) we readily see that this is the wave equation (4.6). 



(4.50) 
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Note that V‘ , as given in (4.47), coincides with the momentum density in equation 
(4.43). This is not an accident. In Lagrangian mechanics, the derivative of the Lagrangian 
with respect to a velocity is the conjugate momentum. For the string, y plays the role of a 
velocity, so V ' , the derivative of the Lagrangian density with respect to y, is a momentum 
density. 

In addition, note that for string endpoints that are free to move, the vanishing of SS 
requires V x = 0. As we can see from (4.47), this is a Neumann boundary condition. Fur- 
thermore, V ' vanishes at the string endpoints for a Dirichlet boundary condition (4.42). A 
more detailed analysis of these facts will be given in Chapter 8, where V' and V x will be 
shown to have an interesting two-dimensional interpretation. 



Problems 



Problem 4.1 Consistency of small transverse oscillations. 

Reconsider the analysis of transverse oscillations in Section 4. 1 . Calculate the horizontal 
force d F/, on the little piece of string shown in Figure 4.1. Show that for small oscilla- 
tions this force is much smaller than the vertical force cl F v responsible for the transverse 
oscillations. 

Problem 4.2 Longitudinal waves on strings. 

Consider a string with uniform mass density /iq stretched between x = 0 and x — a. Let 
the equilibrium tension be 7o. Longitudinal waves are possible if the tension of the string 
varies as it stretches or compresses. For a piece of this string with equilibrium length L, a 
small change AL of its length is accompanied by a small change AT of the tension where 

1 _ 1 AL 
to L AT 

Here tq is a tension coefficient with units of tension. Find the equation governing the small 
longitudinal oscillations of this string. Give the velocity of the waves. 

Problem 4.3 A configuration with two joined strings. 

A string with tension Tq is stretched from x = 0 to x = 2 a. The part of the string 
x e (0, a) has constant mass density /i i , and the part of the string x e (a, 2a) has con- 
stant mass density /xt. Consider the differential equation (4.20) that determines the normal 
oscillations. 



(a) What boundary conditions should be imposed on y (x ) and ^ (x ) at x = al 

(b) Write the conditions that determine the possible frequencies of oscillation. 

(c) Calculate the lowest frequency of oscillation of this string when fi\ = /xq and 
M2 = 2fl 0 . 
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Problem 4.4 Evolving an initial open string configuration. 

A string with tension To, mass density /xo, and wave velocity vq = y/To/no, is stretched 
from (x, y) — (0, 0) to (x, y) = (a, 0). The string endpoints are fixed, and the string can 
vibrate in the y direction. 

(a) Write y(t,x) as in (4.11), and prove that the above Dirichlet boundary conditions 
imply 

h+{u) = —h-(—u) and h + (u) — h + (u + 2a). (1) 

Here u e (— oo, oo) is a dummy variable that stands for the argument of the func- 
tions h±. 

Now consider an initial value problem for this string. At t — 0 the transverse displacement 
is identically zero, and the velocity is 

3y x / x\ 

— (0, x) = d 0 -(1 ), x e (0, a). (2) 

at a \ a / 

(b) Calculate h + {u) for u e (—a, a). Does this define h+(u ) for all ul 

(c) Calculate y (f , x) for x and vq t in the domain D defined by the two conditions 

D = {(x, uof)|0 < x ± Do t < a}. 

Exhibit the domain D in a plane with axes x and vq t. 

(d) At t = 0 the midpoint x = a/2 has the largest velocity of all points in the string. Show 
that the velocity of the midpoint reaches the value of zero at time to = a/(2vo ) and 
that y(to, a/2) = a/ 12. This is the maximum vertical displacement of the string. 

Problem 4.5 Closed string motion. 

We can describe a nonrelativistic closed string fairly accurately by having the string 
wrapped around a cylinder of large circumference 2 tcR on which it is kept taut by the 
string tension To- We assume that the string can move on the surface of the cylinder with- 
out experiencing any friction. Let x be a coordinate along the circumference of the cylinder: 
x ~ x + 2tzR and let y be a coordinate perpendicular to x, thus running parallel to the axis 
of the cylinder. As expected, the general solution for transverse motion is given by 

y(x, t) — h+(x - do 0 + h~(x + Do 0. 

where h + (u) and /i_(d) are arbitrary functions of single variables u and v with 
— oo < u, v < oo. The string has mass per unit length /xo, and Do = V^o/Mo- 

(a) State the periodicity condition that must be satisfied by y (x , t) on account of the iden- 
tification that applies to the x coordinate. Show that the derivatives h' + (u) and h!_( v) 
are, respectively, periodic functions of u and v. 

(b) Show that one can write 

h+(u) — au + f(u ) , h-(v ) = j8i) + g(v ) , 
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where / and g are periodic functions and a and fi are constants. Give the relation 
between a and ft that follows from (a). 

(c) Calculate the total momentum carried by the string in the y direction. Is it conserved? 
Problem 4.6 Stationary action: minima and saddles. 

A particle perfoming harmonic motion along the x axis can be used to show that classical 
solutions are not always minima of the action functional. The action for this particle is 

S[x] = J Ldt = J dt -m 2 (x 2 — cb 2 x 2 ) , 

where m is the mass of the particle, w is the frequency of oscillation, and the motion hap- 
pens for t e [0, if]. Consider a classical solution x(t) and a variation Sx(t) that vanishes 
at t = 0 and t = tf. 

(a) Show that the variation of the action is exactly given by 

AS[«$x] = S[x + 8x] - S[x] = \m 2 ' Jt((^) 2 - br<5x 2 ) . 

It is noteworthy that AS only depends on <5x; x drops out from the answer. 

(b) A complete set of variations that vanish at t — 0 and t = tf takes the form 

nn 

8 n x = sinft)„t , with co„ = — and n — 1, 2, . . ., oo . 

tf 

The general variation Sx that vanishes at t = 0 and t = tf is a linear superposition of 
variations 8 n x with arbitrary coefficients b„. Calculate A.S'[<5„x] (your answer should 
vanish for o>„ = d>). Prove that 

OO CO 

71=1 71=1 

(c) Show that for tf < T one gets AS[5„x] > 0 for all n > 1. Explain why this guarantees 
that the classical solution is a minimum of the action. Show that for Z < tf < =£- all 
variations 8 n x lead to AS > 0, except for <5jx, which leads to AS < 0. In this case 
the classical solution is a saddle point: there are variations that increase the action and 
variations that decrease the action. As tf increases, the number of variations 8 n x that 
decrease the action increases. 

Problem 4.7 Variational problem for strings. 

Consider a string stretched from x = 0 to x = a, with a tension 7o and a position- 
dependent mass density /i(x). The string is fixed at the endpoints and can vibrate in the y 
direction. Equation (4.20) determines the oscillation frequencies co, and associated profiles 
i j/j (x) for this string. 

(a) Set up a variational procedure that gives an upper bound on the lowest frequency of 
oscillation o>o- (This can be done roughly as in quantum mechanics, where the ground 
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state energy Eq of a system with Hamiltonian H satisfies Eq < (i Jr, H\Jr)/(\[r, \J/).) As 
a useful first step consider the inner product 

ifi,fj) = J x)fi(x)\lrj(x)dx 

and show that it vanishes when <w, ^ Wj. Explain why your variational procedure 
works. 

(b) Consider the case /r (x ) = /tq-. Use your variational principle to find a simple bound 
on the lowest oscillation frequency. Compare with the answer co^ — ( 1 8.956) -^5 
obtained by a direct numerical solution of the eigenvalue problem. 

Problem 4.8 Deriving Euler-Lagrange equations.^ 

(a) Consider an action for a dynamical variable q(t): 

S = J dt L{q{t), q(t); t ). ( 1 ) 

Calculate the variation SS of the action under a variation Sq(t) of the coordinate. Use 
the condition SS = 0 to find the equation of motion for the coordinate q(t) (the Euler- 
Lagrange equation). 

(b) Consider an action for a dynamical field variable <p(t,x). As indicated, the field is 
a function of space and time, and is briefly written as the spacetime function <p(x). 
The action is obtained by integrating the Lagrangian density C over spacetime. The 
Lagrangian density is a function of the field and the spacetime derivatives of the held: 

S = Jd D x £(0W,8^W). (2) 

Here d D x = dt dx 1 . . ,dx d , and 9^0 = dtp/dx 11 . Calculate the variation SS of the 
action under a variation S<p{x) of the held. Use the condition SS — 0 to hnd the 
equation of motion for the held <p(x) (the Euler-Lagrange equation). 



r 

5 The relativistic point particle 

L 



To formulate the dynamics of a system we can write either the equations of 
motion or, alternatively, an action. In the case of the relativistic point particle it 
is rather easy to write the equations of motion. But the action is so physical and 
geometrical that it is worth pursuing in its own right. More importantly, while it 
is difficult to guess the equations of motion for the relativistic string, the action 
is a natural generalization of the relativistic particle action that we will study in 
this chapter. We conclude with a discussion of the charged relativistic particle. 



5.1 Action for a relativistic point particle 



In this section we learn how to formulate the relativistic theory that describes a free point 
particle of rest mass m > 0. A free particle is a particle that is not subject to any force. Our 
analysis begins with some preliminary remarks about units and nonrelativistic particles. 

For any dynamical system, the action S is obtained by integrating the Lagrangian over 
time. Since the Lagrangian has units of energy, the action has units of energy times time: 

L 2 ML 2 

[S] = M^T = —. (5.1) 

It is worth noting that the action has the same units as h. Indeed, a form of the quantum 
mechanical uncertainty principle states that the product of energy and time uncertainties is 
of order h . 



The action S m for a free nonrelativistic particle is given by the time integral of the kinetic 
energy: 



mv z (t)dt, v 2 = v-v, v = 



dx 

2 ’ dt ’ 

The equation of motion which follows by Hamilton’s principle is 



S nr — J L nr dt — J 2 



v = u . 



(5.2) 



-= 0 . 

dt 



(5.3) 



The free particle moves with constant velocity. Since even a free relativistic particle must 
move with constant velocity, how do we know that the action S nr is not correct in relativity? 
Perhaps the simplest answer is that this action allows the particle to move with any constant 
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velocity, even one that exceeds the velocity of light. The velocity of light does not even 
appear in this action. S m cannot be the action for a relativistic point particle. 

We now construct a relativistic action S for the free point particle. We will do this by 
making an educated guess and then showing that it works properly. Since we are interested 
in relativistic physics it is convenient to represent the motion of the particle in spacetime. 
The path traced out by the particle in spacetime is called the world-line of the particle. 
Even a static particle traces a line in spacetime since time always flows. 

A physically consistent action must yield Lorentz invariant equations of motion. Let us 
elaborate on this point. Suppose that a particular Lorentz observer tells you that a particle 
appears to be moving in accordance to its equations of motion, plainly, that the particle 
is performing physical motion. Then, you should expect that any other Lorentz observer 
will tell you that the particle is doing physical motion. It would be inconsistent for one 
observer to state that a certain motion is allowed and for another observer to state that the 
same motion is forbidden. If the equations of motion hold in a fixed Lorentz frame, they 
must hold in all Lorentz frames. This is what Lorentz invariance of the equations of motion 
means. 

We are going to write an action, and we are going to take our time to find the equations of 
motion. Is there any way to impose a constraint on the action that will result in the Lorentz 
invariance of the equations of motion? Yes, there is. We require the action to be a Lorentz 
scalar: for any particle world-line, all Lorentz observers must compute the same value for 
the action. Since the action has no spacetime indices, this is a reasonable requirement. If the 
action is a Lorentz scalar, the equations of motion will be Lorentz invariant. The reason is 
simple and neat. Suppose one Lorentz observer states that, for a given world-line, the action 
is stationary against all variations of the world-line. Since all Lorentz observers agree on 
the value of the action for any world-line, they will all agree that the action is stationary 
about the world-line in question. By Hamilton’s principle, the world-line that makes the 
action stationary satisfies the equations of motion, and therefore all Lorentz observers will 
agree that the equations of motion are satisfied for the world-line in question. 

Lorentz invariance imposes strong constraints on the possible forms of the action. In 
fact, there are valid grounds to worry that Lorentz invariance is too strong a constraint 
on the action. The nonrelativistic action in (5.2), for example, is not invariant under a 
Galilean boost v — »• v + vq with constant Co- Such a boost is a symmetry of the theory, 
since the equation of motion (5.3) is invariant. Similarly, it could happen that the equations 
of motion for the relativistic point particle are Lorentz invariant but that the action is not. 
Fortunately, this complication does not occur in this case; we will find a satisfactory fully 
Lorentz invariant action. 

Quick calculation 5.1 Calculate explicitly the variation of the action S ni under a boost. 

We know that the action is a functional - it takes as input a set of functions that describe 
a world-line and it outputs a number S. Imagine a particle whose spacetime trajectory 
starts at the origin and ends at ( ctf,Xf ). There are many possible world-lines between 
the starting and ending points, as shown in Figure 5.1 (which uses one spatial dimension 
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A spacetime diagram with a series of world-lines connecting the origin to the spacetime 
point (ct f , x f ). 

for ease of representation). We would like that, for any world-line, all Lorentz observers 
compute the same value for the action. Let V denote one world-line. What quantity related 
to V do all Lorentz observers agree on? The elapsed proper time! All Lorentz observers 
agree on the amount of time that elapses on a clock carried by the moving particle. So let us 
take the action of the world-line V to be proportional to the proper time associated with it. 

To formulate this idea quantitatively, we recall that 

- ds 2 = - c 2 dt 2 + ( dx l ) 2 + (dx 2 ) 2 + (dx 3 ) 2 , (5.4) 

and that the infinitesimal proper time is equal to ds/c (recall that ds 2 = (ds) 2 since inter- 
vals are timelike). The integral of (ds/c) over the path V gives the proper time elapsed on 
V. Since proper time has units of time, to get the units of action we need an additional mul- 
tiplicative factor with units of energy or units of mass times velocity-squared. This factor 
should be Lorentz invariant, to preserve the Lorentz invariance of our partial guess (ds/c). 
For the mass we can use m, the rest mass of the particle, and for the velocity we can use c, 
the fundamental velocity in relativity. We cannot use the particle velocity because it is not 
a Lorentz invariant. The factor is then me 2 , which is, in fact, the rest energy of the particle. 
Therefore, our guess for the action is the integral of me 2 (ds/c) = me ds. Of course, there 
is still the possibility that a dimensionless numerical factor is missing. It turns out that 
there should be a minus sign, but the unit coefficient is correct. We therefore claim that the 
correct action is 



5 = 




(5.5) 



The action is equal to minus the rest energy times the proper time. This action is so simple 
looking that it may be baffling. It probably looks nothing like the actions you have seen 
before. We can make its content more familiar by choosing a particular Lorentz observer 
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and expressing the action as the integral of a Lagrangian over time. With the help of (5.4) 
we relate ds to dt by 



ds = c dt 




(5.6) 



This allows us to write the action in (5.5) as an integral over time: 




where f; and tf are the values of time at the initial and final points of the world-line V, 
respectively. From this version of the action, we see that the relativistic Lagrangian for the 
point particle is 




The Lagrangian is equal to minus the rest energy times a relativistic factor. This Lagrangian 
makes no sense when v > c since it ceases to be real. The constraint of maximal velocity 
is therefore implemented. This could have been anticipated: proper time is only defined 
for motion where the velocity does not exceed the velocity of light. The paths shown in 
Figure 5.1 all represent motion where the velocity of the particle never exceeds the velocity 
of light. Only for such paths is the action defined. At any point in any of those paths, the 
tangent vector to the path is a timelike vector. 

To show that this Lagrangian gives the familiar physics in the limit of small velocities, 
we expand the square root assuming v c. Keeping just the first term in the expansion 
gives 



'K?)- 



2 1 2 
-me H — mv . 
2 



(5.9) 



Constant terms in a Lagrangian do not affect the equations of motion, so the term (—me 2 ) 
can be ignored for this purpose. The leading significant term coincides with the nonrela- 
tivistic Lagrangian in (5.2), showing that the familiar nonrelativistic physics emerges. This 
also confirms that we normalized the relativistic Lagrangian correctly. 

The canonical momentum is the derivative of the Lagrangian with respect to the velocity. 
Using (5.8) we find 



3 L 2 / 5 \ 1 mv 

dv V C 2 / y 7 ! — u 2 /c 2 y 7 ! — v 2 /c 2 



(5.10) 



This is just the relativistic momentum of the point particle. What about the Hamiltonian? 
It is given by 
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H = p ■ v — L = + me' 

1 n, o i o 



y/l ~ V 2 /C 2 



.2 




where the result was left as a function of the velocity of the particle, rather than as a 
function of its momentum. As expected, the answer coincides with the relativistic energy 
(2.68) of the point particle. 

We have therefore recovered the familiar physics of a relativistic particle from the rather 
remarkable action (5.5). This action is very elegant: it is briefly written in terms of the 
geometrical quantity ds, it has a clear physical interpretation as total proper time, and it 
manifestly guarantees the Lorentz invariance of the physics it describes. 



In this section we explore an important property of the point particle action (5.5). This 
property is called reparameterization invariance. To evaluate the integral in the action, an 
observer may find it useful to parameterize the particle world-line. Reparameterization 
invariance of the action means that the value of the action is independent of the parame- 
terization chosen to calculate it. This should be so, since the action (5.5) is in fact defined 
independently of any parameterization: the integration can be done by breaking V into 
small pieces and adding the values of me ds for each piece. No parameterization is needed 
to do this. In practice, however, world-lines are described as parameterized lines, and the 
parameterization is used to compute the action. 

We parameterize the world-line V of a point particle using a parameter r (Figure 5.2). This 
parameter must be strictly increasing as the world-line goes from the initial point x? to the 
final point x't , but is otherwise arbitrary. As r ranges in the interval [r , r f \ it describes the 
motion of the particle. To have a parameterization of the world-line means that we have 
expressions for the coordinates x 11 as functions of r: 



5.2 Reparameterization invariance 






(5.12) 




A world-line fully parameterized by r. All spacetime coordinates x ^ are functions of r. 
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We also require 

xj 1 = x /i (r l ), Xj=x IJ '(Tf). (5.13) 

Note that even the time coordinate x° is parameterized. Normally, we use time as a param- 
eter and describe position as a function of time. This is what we did in Section 5.1. But 
if we want to treat space and time coordinates on the same footing, we must parameterize 
both in terms of an additional parameter r . 

We now reexpress the integrand ds using the parameterized world-line. To this end, we 
use ds 2 — — r]^ v dx ll dx v to write 

t dx M dx v 9 

ds" — — (dr) 2 . (5.14) 

dr dr 

For any motion where the velocity does not exceed the velocity of light ds 2 = (ds) 2 , and 
therefore the action (5.5) takes the form 



S = 



—me 




dx ^ dx v 
dr dr 



dr. 



(5.15) 



This is the explicit form of the action when the path has been parameterized by r. 

We have already seen that the value of the action is the same for all Lorentz observers. 
We have now fixed an observer, who has calculated the action using some parameter r. 
Does the value of the action depend on the choice of parameter? It does not. The observer 
can reparameterize the world-line, and the value of the action will be the same. Thus S is 
reparameterization invariant. To see this, suppose we change the parameter from r to r'. 
Then, by the chain rule, 

dx M dx 11 dr' 

= . (5.16) 

dr dr' dr 

Substituting back into (5.15), we get 



S = 



■r 

j ti 



dx t 1 dx v dr' 

-n uv dr = 

^ dr' dr' dr 



■c 



dxf dx v . 



(5.17) 



which has the same form as (5.15). thus establishing the reparameterization invariance. 
Because the verification of this property is quite simple, we say that the action (5.15) is 
manifestly reparameterization invariant. 



5.3 Equations of motion 



We now move on to the equations of motion. For this we must calculate the variation 8S 
of the action (5.5) when the world-line of the particle is varied by a small amount 8x l ‘(r). 
Flere r is an arbitrary parameter along the path. The variation is simply given by 



8S 



= —me 



I 



8(ds). 



(5.18) 
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The variation of ds can be found from the simpler variation of ds 2 — ( ds ) 2 . Varying both 
sides of (5.14) we find 

2ds8(ds) = -2r ] ^8(^)^(dr) 2 . (5.19) 

The factor of two on the right-hand side arises because, by symmetry, the variations of 
and ^4- give the same result. Since the variation of a velocity is equal to the time derivative 
of the variation of the coordinate, 



4— 1 = ^ 1 . 



V dr / 



dr 



Using this result and simplifying (5.19) a bit, we get 



S(ds) = rj^iv 



c/(<5x M ) dx x 
dr ds 



dr = 



diSx^) dx ^ 
dr ds 



dr, 



(5.20) 



(5.21) 



where r/ llv was used to lower the index of dx v . We can now go ahead and vary the action 
using (5.18): 



SS = me 



l 



x f d(8x M ) dx u 

— dr. 

dr ds 



(5.22) 



Here we introduced explicit limits to the integration: r,- and r / denote the values of the 
parameter at the initial and final points of the world-line, respectively. We recognize that 



mc- 



dxj, 

ds 



= mu, i = 



Pm 



and, as a result, the variation of the action takes the form 

" r f d(8x **) 



SS = 



-f 



dz 



Pn dr. 



(5.23) 



(5.24) 



To get an equation of motion we need to have Sx^ multiplying an object under the integral - 
the equation of motion is then simply the vanishing of that object. Since there are still 
derivatives acting on Sx 11 , we rewrite the integrand as a total derivative plus additional 
terms where Sx jL appears multiplicatively: 



SS = 




d 

dr 




dr Sx^i r) 



dpn 

dr 



(5.25) 



The first integral gives Sx 11 p lt evaluated at the boundaries of the world-line. This term 
vanishes because we fix the coordinates on the boundaries. Since the second term must 
vanish for arbitrary ()x ll (r), we obtain the equation of motion 




(5.26) 



It is clear that dp 11 /dr also vanishes. The equation of motion states that the momentum 
Pij (or //' ) of the point particle is constant along its world-line. This is a parameterization- 
independent statement. It implies, of course, that the momentum is constant in time. If a 
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function is constant over a line, its derivative with respect to any parameter used to describe 
the line will vanish. Indeed, the parameter r in (5.26) is arbitrary. We obtained this equation 
by varying the relativistic action for the point particle using fully relativistic notation. 

Quick calculation 5.2 Show that equation (5.26) implies that 

^ = 0 (5.27) 

dr' 

holds for an arbitrary parameter r'(r). What should be true about dr' /dr for r' to be a 
good parameter when r is one? 



If we parameterize the world-line with the proper time s, equation (5.26) gives 




(5.28) 



Using (5.23) to write the momentum as a derivative of the position with respect to proper 
time, we find 



d 2 x ^ 




ds 2 _0 ' 


(5.29) 



This is an equivalent version of the equation of motion. The constancy of dx^ /ds means 
that on a path marked by equal intervals of proper time, the change in x /l between any 
successive pair of marks is the same. Equation (5.29) does not hold when s is replaced 
by an arbitrary parameter r. This is reasonable: an arbitrary parameter means arbitrar- 
ily spaced marks, so the change in x 1 ' between any successive pair of new marks need 
not be the same. It is actually possible to write a slightly more complicated version 
of (5.29) that uses an arbitrary parameter and is manifestly reparameterization invariant 
(Problem 5.2). 



Our goal in this section has been achieved: we have shown how to derive the physically 
expected equation of motion (5.29) (or (5.26)), starting from the Lorentz invariant action 
(5.5). As we explained earlier, the resulting equation of motion is guaranteed to be Lorentz 
invariant. Let us check this explicitly. 

Under a Lorentz transformation, the coordinates x 1 ' transform as indicated in equation 
(2.38): x' 1 ' — [J’v x v , where the constants lJ\, can be viewed as the entries of an invertible 
matrix L. Since ds is the same in all Lorentz frames, the equation of motion in primed 
coordinates is (5.29), with x 1 ' replaced by x" 1 : 



0 = 



d 2 x^ 
ds 2 



d 2 _ 
ds 2 



{L»x v ) = L» 



d 2 x v 

ds 2 



(5.30) 



Since the matrix L is invertible, the above equation implies equation (5.29). Namely, if the 
equation of motion holds in the primed coordinates, it holds in the unprimed coordinates 
as well. This is the Lorentz invariance of the equations of motion. 
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( 



5.4 Relativistic particle with electric charge 



The point particle we have considered so far is free and it moves with constant four- velocity 
or four-momentum. If a point particle is electrically charged and there are nontrivial elec- 
tromagnetic fields, the particle will experience forces and its four-momentum will not be 
constant. You know, in fact, how the momentum of such a particle varies in time. Its time 
derivative is governed by the Lorentz force equation (3.5), which was written in relativistic 
notation in Problem 3.1: 



dp,i _ <? dx v 
ds c M v ds 



(5.31) 



This is a relatively intricate equation which involves the field strength and the four- velocity 
of the particle. Since ds appears on both sides of the equation, the equation in fact holds 
for a general parameter r : 



dpg _ <7 dx^_ 

dx c ax 



(5.32) 



In the spirit of our previous analysis we try to write an action that gives this equation upon 
variation. The action turns out to be remarkably simple. 



Since the Maxwell field couples to the point particle along its world-line V, we should 
add to the action (5.5) an integral over V representing the interaction of the particle with 
the electromagnetic field. The integral must be Lorentz invariant, and the form of (5.32) 
suggests that it involves the four- velocity of the particle. Since the four- velocity has one 
spacetime index, to obtain a Lorentz scalar we must multiply it against another object with 
one index. The natural candidate is the gauge potential 4 /( . We claim that the interaction 
term in the action is 

a C dx ^ 

- / dx A m (x( r)) — (t). (5.33) 

c J-p dr 

Here q is the electric charge, and the integral is over the world-line V , parameterized 
with the arbitrary parameter r. At each r, the vector (dx^ /dx) is dot multiplied against 
the gauge potential 4,,. evaluated at the position x(x ) of the particle. The integrand can be 
written more briefly as A^dx 1 *, by cancelling the factors ofdx. In this form, the interaction 
term is manifestly independent of parameterization. The world-line of the particle is a one- 
dimensional space, and the natural field that can couple to a particle in a Lorentz invariant 
way is a field with one index. This will have an interesting generalization when we consider 
the motion of strings. Since strings are one-dimensional, they trace out two-dimensional 
world-sheets in spacetime. We will see that they couple naturally to fields with two Lorentz 
indices! 

The full action for the electrically charged point particle is obtained by adding the term 
in (5.33) to (5.5): 
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S = — me 



[ ds + — [ A ll (x)dx IJ '. 
JV c Jp 



(5.34) 



This Lorentz invariant action is simple and elegant. The equation of motion (5.32) arises by 
setting to zero the variation of S under a change Sx 1 * of the particle world-line. I do not want 
to take away from you the satisfaction of deriving this important result. 1 have therefore left 
to Problem 5.5 the task of varying the action (5.34) and deriving the equation of motion. 



Problem 5.1 Point particle equation of motion and reparameterizations. 

If the path of a point particle is parameterized by proper time, the equation of motion is 
(5.29). Consider now a new parameter r = f (s). Find the most general function / for 
which (5.29) implies 



Problem 5.2 Particle equation of motion with arbitrary parameterization. 

Vary the point particle action (5.15) to find a manifestly reparameterization invariant form 
of the free particle equation of motion. 

Problem 5.3 Current of a charged point particle. 

Consider a point particle with charge q whose motion in a D = d + 1 -dimensional space- 
time is described by functions x M (r) = {x°(r), x(r)}, where r is a parameter. The moving 
particle generates an electromagnetic current j ,L = (cp, j). 

(a) Use delta functions to write expressions for the current components j°(x, t ) and 



Here S D (x) = 8(x°)8(x') . . .S(x d ). 

Problem 5.4 Hamiltonian for a nonrelativistic charged particle. : 

The action for a nonrelativistic particle of mass m and charge q coupled to an electromag- 
netic field is obtained by replacing the first term in (5.34) by the nonrelativistic action for 
a free point particle: 



Problems 




j‘(x, t). 

(b) Show that your answers in (a) arise from the integral representation 





We have also chosen to use time to parameterize the second integral. 




5.4 Relativistic particle with electric charge 



(a) Rewrite the action S in terms of the potentials (<t>, A) and the ordinary velocity v. What 
is the Lagrangian? 

(b) Calculate the canonical momentum p conjugate to the position of the particle and show 
that it is given by p = m v + | A . 

(c) Show that the Hamiltonian for the charged particle is 




Problem 5.5 Equations of motion for a charged point particle. 

Consider the variation of the action (5.34) under a variation W'tx) of the particle trajec- 
tory. The variation of the first term in the action was obtained in Section 5.3. Vary the 
second term (written more explicitly in (5.33)) and show that the equation of motion is 
(5.32). Begin your calculation by explaining why 

5A /X (x(r)) = ^^-(x(r)) Sx v (r). 

Problem 5.6 Electromagnetic field dynamics with a charged particle. 1 

The action for the dynamics of both a charged point particle and the electromagnetic field 
is given by 

S' — — me [ ds + — [ An{x)dx p [ d D x Fu V F ,xv . 

JV c Jv 4c J 

Here d D x = dx°dx 1 . . . dx d . Note that the action S' is a hybrid; the last term is an integral 
over spacetime, and the first two terms are integrals over the particle world-line. While 
included for completeness, the first term will play no role here. Obtain the equation of 
motion for the electromagnetic field in the presence of the charged particle by calculating 
the variation of S' under a variation SA^ of the gauge potential. The answer should be 
equation (3.34), where the current is the one calculated in Problem 5.3. [Hint: to vary 
A IL (x ) in the world-line action it is useful to rewrite this term as a full spacetime integral 
with the help of delta functions.] 

Problem 5.7 Point particle action in curved space. 

In Section 3.6 we considered the invariant interval ds 2 = —g llv (x)dx ll dx v in a curved 
space with metric g /lv (x). The motion of a point particle of mass m on curved space is 
studied using the action 




Show that the equation of motion obtained by variation of the world-line is 

1 dg^ dx p dx v 

2 dx p ds ds 

This is called the geodesic equation. When the metric is constant we recover the equation 
of motion of a free point particle. 



d dx p 

ds ds 



c 



Relativistic strings 



We now begin our study of the classical relativistic string - a string that is, in 
many ways, much more elegant than the nonrelativistic one considered before. 
Inspired by the point particle case, we focus our attention on the surface traced 
out by the string in spacetime. We use the proper area of this surface as the 
action; this is the Nambu-Goto action. We study the reparameterization property 
of this action, identify the string tension, and find the equations of motion. For 
open strings, we focus on the motion of the endpoints and introduce the concept 
of D-branes. Finally, we see that the only physical motion is transverse to the 
string. 



6.1 Area functional for spatial surfaces 



The action for a relativistic string must be a functional of the string trajectory. Just as a 
particle traces out a line in spacetime, a string traces out a surface. The line traced out 
by the particle in spacetime is called the world-line. The two-dimensional surface traced 
out by a string in spacetime will be called the world-sheet. A closed string, for example, 
will trace out a tube, while an open string will trace out a strip. These two-dimensional 
world-sheets are shown in the spacetime diagram of Figure 6. 1 . The lines of constant x° in 
these surfaces are the strings. These are the objects an observer sees at the fixed time jc°. 
They are open curves for the surface describing the open string evolution (left), and they 
are closed curves for the surface describing the closed string evolution (right). 

In Chapter 5 we learned that the point particle action is proportional to the proper time 
elapsed on the point particle world-line. The proper time, multiplied by c, is the Lorentz 
invariant “proper length” of the world-line. For strings we will define the Lorentz invariant 
“proper area” of a world-sheet. The relativistic string action will be proportional to this 
proper area, and is called the Nambu-Goto action. 

Area functionals are useful in other applications: a soap film held between two rings, for 
example, automatically constructs the surface of minimal area which joins one ring to the 
other (Figure 6.2). The string world-sheet and the soap bubble between two rings are very 
different types of surfaces. At any given instant of time a Lorentz observer will see the full 
two-dimensional surface of the soap film, but he or she can only see one string from the 
two-dimensional world-sheet. Imagine that the soap film is static in some Lorentz frame. 
In this case, time is not relevant to the description of the film, and we think of the film as 
a spatial surface, namely, a surface that extends along two spatial dimensions. The surface 
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A spatial surface stretching between two rings. If the surface were a soap film, it would be 
a minimal area surface. 

exists in its entirety at any instant of time. We will first study these familiar surfaces, and 
then we will apply our experience to the case of surfaces in spacetime. 

A line in space can be parameterized using only one parameter. A surface in space is 
two-dimensional, so it requires two parameters £ 1 and f 2 . Given a parameterized surface, 
we can draw on that surface the lines of constant and the lines of constant § 2 . These lines 
cover the surface with a grid. We call target space the world where the two-dimensional 
surface lives. In the case of a soap bubble in three dimensions, the target space is the three- 
dimensional space x 1 , x 2 , and x 3 . The parameterized surface is described by the collection 
of functions 
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Left: the parameter space, with a little rectangle selected. Right: the target space surface 
with the image of the little rectangle, a parallelogram whose sides are the vectors dv- \ and 
dv2 (shown magnified at the end of the wiggly arrow). 

X (£\ § 2 ) = (x'G 1 , § 2 ) , x 2 ($\ £ 2 ) , x 3 (§\ £ 2 )) . (6.1) 

The parameter space is defined by the ranges of the parameters f 1 and § 2 . It may be a 
square, for example, if we use parameters § 1 , £ 2 e [0, rr]. The physical surface is the image 
of the parameter space under the map x(f £ 2 ); it is a surface in target space. Alternatively, 
we can view the parameters f 1 and £ 2 as coordinates on the physical surface, at least 
locally. The map inverse to x takes the surface to the parameter space. Locally this map 
is one-to-one and it assigns to each point on the surface two coordinates: the values of the 
parameters and § 2 . 

We want to calculate the area of a small element of the target space surface. Let us start 
by looking at an infinitesimal rectangle on the parameter space. Denote the sides of the 
rectangle by 1 and d% 2 . We want to find d ,4 , the area of the image of this little rectangle 
in the target space. As shown in Figure 6.3, this is the area of the actual piece of surface 
that corresponds to the infinitesimal rectangle on parameter space. 

Of course, there is no reason why that infinitesimal area element in target space should 
be a rectangle. In general, it is a parallelogram. Let us call the sides of this parallelogram 
dv i and dv 2 . They are the images under the map x of the vectors (7/| 1 , 0) and (0, d% 2 ), 
respectively. We can write them as 

_ dx 1 _ dx 7 

dv 1 = —d^, dv 2 =^dH 2 . (6.2) 

This makes sense: dx/d £*, for example, represents the rate of variation of the space 
coordinates with respect to £*. Multiplying this rate by the length d f 1 of the horizontal 
side of the tiny parameter-space rectangle, gives us the vector dv \ that represents this side 
in the target space. Now let us calculate the area dA. Using the formula for the area of a 
parallelogram, 
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dA = | |rfv 2 l I sin 6 ? | = \dv\ \\dv2\j 1 — cos 2 6 

= — \dx>\ \ 2 \dv2\ 2 cos 2 9, (6.3) 

where 6 is the angle between the vectors dx>\ and dvj- In terms of spatial dot products, 
we have 

dA = \J (dv\ ■ dvi)(dv2 ■ di>2) — (z/i>i ■ dx>2) 2 ■ (6.4) 



Finally, using (6.2), 



I( 


dx \ / dx 


dx \ 


/ dx 


dx \ 


\d^ 


’ 9^/V9f 2 ' 


' df) 


\dl l 


' 9^ 2 / 



(6.5) 



This is the general expression for the area element of a parameterized spatial surface. The 
full area functional A is given by 






dx dx 

W ’ W 



\ / dx dx \ 2 

/ VgF' d^) • 



( 6 . 6 ) 



The integral extends over the relevant ranges of the parameters and £ 2 . The solution of 
a minimal area problem for a spatial surface is the function x (£', % 2 ) that minimizes the 
functional A. 



6.2 Reparameterization invariance of the area 



As we have seen, the parameterization of a surface allows us to write the area element in 
an explicit form. The area of the surface, or even more, the area of any piece of the surface, 
should be independent of the parameterization chosen to calculate it. This is what we mean 
when we say that the area must be reparameterization invariant. 

Because we will soon equate the relativistic string action to some notion of proper area, 
it, too, will be reparameterization invariant. This means that we will be free to choose the 
most useful parameterization without changing the underlying physics. A good choice of 
parameterization will enable us to solve the equations of motion of the relativistic string in 
an elegant way. 

Reparameterization invariance is thus an important concept so it should be understood 
thoroughly. To this end we will try to make it manifest in our formulae. The aim of the 
following analysis is to show how this can be done. 

Let us begin by asking: is the area functional A in (6.6) reparameterization invariant? We 
would certainly hope it is. In fact, at first glance it appears to be manifestly reparameteri- 
zation invariant. After all, if one reparameterizes the surface with f 1 (£ 1 ) and f 2 (| 2 ), then 
all of the derivatives introduced by the chain rule cancel appropriately. 

Quick calculation 6.1 Verify the above statement. That is, show that (6.6), written fully 
with tilde parameters (f 1 , f 2 ), equals (6.6) when f 1 = |*(f : ) and f 2 = f 2 (§ 2 ). 
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The above reparameterization, however, is not completely general for it fails to mix the f 1 
and q 2 coordinates. Suppose, instead, that we make a reparameterization f 1 1 , £ 2 ) and 

f 2 (£ 1 , £ 2 ). This time we can verify, using a somewhat laborious computation, that (6.6) is 
invariant under such a reparameterization. But the invariance is no longer intuitively clear. 
To make the reparameterization invariance of (6.6) manifest we will have to rewrite the 
area functional in a different way. 

We begin by observing how the measure of integration transforms. The change-of- 
variable theorem from calculus tells us that 



= 



det 



(-) 

\d^jJ 



dl x d~f = | det M| d% l d% 2 . 



where M — [M;j] is the matrix defined by M, ; - = d%‘ /d% ' . Similarly, 



dl l dl 2 = 



det(^) 

V3 i-j) 



d^df = | detM| d% l d% 2 , 



(6.7) 



( 6 . 8 ) 



where M — [Mjj\ is the matrix defined by M,j = di; 1 /di; J . Combining equations (6.7) 
and (6.8), we see that 

| det M\ | det M\ — 1 . (6.9) 



Let us now consider a target space surface S described by the mapping functions 3c (£ 1 , £ 2 ). 
Given a vector dx tangent to the surface, let ds denote its length. Then we can write 

ds 2 = (ds) 2 = dx ■ dx. (6.10) 



For surfaces in space, as we are considering now, it is not customary to include a minus 
sign in front of ds 2 (compare with (2.21)). The vector dx can be expressed in terms of 
partial derivatives and the differentials d% 2 : 



_ dx , dx dx 

dx = w d * + W d * = W dr - 



(6.11) 



The repeated index i is summed over its possible values 1 and 2. Back in (6.10), 

= (—d?) . Aft) = 81.11 it i it i . 

\d$' ) \dt;J / d$‘ d$J 

This can be neatly summarized as 



(6.12) 



ds* = gi jmd?d$> , 



(6.13) 



where gjj (§ ) is defined as 



dx 

8ij(^) = ^7 



dx 

w 



(6.14) 



The quantity gjj (f ) is known as the induced metric on S. It is called a metric because (6. 13) 
takes, up to a sign, the form of equation (3.78), where we introduced the general concept of 
a metric. It is a metric on S because, with §' playing the role of coordinates on S , equation 
(6.13) determines distances on S. It is said to be induced because it uses the metric on the 
ambient space in which S lives to determine distances on S. Indeed, the dot product which 
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appears in (6.14) is to be performed in the space where S lives and therefore presupposes 
that a metric exists on that space. We only have two parameters f 1 and £ 2 , so the full matrix 
gij takes the form: 



( dx 


dx 


dx 


dx ) 


w 


w 


w 


w 


dx 


dx 


dx 


dx 


\W 


w 


W 


W-) 



Now we see something truly nice! The determinant of gij is precisely the quantity which 
appears under the square root in (6.6). Letting 



g = det (gij) , 



(6.16) 



we can write 



A = 



J d% l df-^g. 



(6.17) 



This is an elegant formula for the area in terms of the determinant of the induced metric. 
Instead of trying to understand the reparameterization invariance of (6.6), we now focus on 
the equivalent but simpler expression (6.17). 

We are now in position to understand the invariance of the area in terms of the trans- 
formation properties of the metric gij. The key to this lies in equation (6.13). The 
length-squared ds 2 is a geometrical property of the vector dx that must not depend upon 
the particular parameterization used to calculate it. For another set of parameters f and 
metric g(f ), the following equality must therefore hold: 

gijmd^d^j = g pq {bdk p dl« . (6.18) 

Making use of the chain rule to express the differentials dl; in terms of differentials c/£, 

■ • - 3 3f« • • 

gij^d^d^ = ~g pq {H) JjJ^]d!;'di;J . (6.19) 

Since this result holds for any choice of differentials dtj, we find a relation between the 
metric in | and f coordinates: 

- 3|^3|9 

gij(.S) = g Pq (.S)^ r ^j. ( 6 . 20 ) 

Making use of the definition of M below (6.8), we rewrite the above equation as 

— 8pq M pi Mqj = (M )ipgpq M q j . (6.21) 

In matrix notation, the right-hand side is the product of three matrices. Taking the 
determinant and using the notation in (6.16) gives 

g = (det M t ) g (det M) — g (det M) 2 . 



(6.22) 
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Taking a square root 

V^=7l|detM| ; (6.23) 

we obtain the transformation property for the square root of the determinant of the metric. 

We are finally ready to appreciate the reparameterization invariance of (6.17). Making 
use of (6.7), (6.23), and (6.9) we have 

J d$ 1 d$ 2 Jg = J dl l dk 2 \teLM\yfi\&AM\ = J (6.24) 

which proves the reparameterization invariance of the area functional. To the trained 
eye the area formula in (6.17) is manifestly reparameterization invariant. That is, once 
you know how metrics transform, the invariance is reasonably simple to establish. No 
cumbersome calculation is necessary. 

Quick calculation 6.2 Consider the equation 3£ ! /d^ — S'j and use the chain rule to show 
the matrix property 

MM = 1 . (6.25) 

Show that MM — 1 holds as well. Finally, note that detMdetM = 1, a result stronger 
than the one we proved in (6.9). 



6.3 Area functional for spacetime surfaces 



Let us now move to our case of interest, the case of surfaces in spacetime. These surfaces 
are obtained by representing in spacetime the history of strings, in the same way as a 
spacetime world-line is obtained by representing the history of a particle. For the case of 
strings, we obtain a two-dimensional surface called the world-sheet of the string. Spacetime 
surfaces, such as string world-sheets, are not all that different from the spatial surfaces we 
considered in the previous section. They are two-dimensional and require two parameters. 
Instead of calling the parameters §* and £ 2 , we give them special names: r and a . 

Given our usual spacetime coordinates x ,L — (x°, x 1 , . . ., x d ), the surface is described 
by the mapping functions 

x /i (r, ct), (6.26) 



which take some region of the (r, er) parameter space into spacetime. Following a standard 
convention in string theory, we change the notation slightly. We will denote the above 
mapping functions with the capitalized symbols 

X^{x ,a). (6.27) 



We are not changing the meaning of the functions. Given a fixed point (r, er) in the 
parameter space, this point is mapped to a point with spacetime coordinates 

(X°(r, a ) , X\x, a), . . ., X d (r, er)). 



(6.28) 
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a 



da □ 
dr 

T 




Left: the parameter space (t, a), with a little square selected. Right: the surface in target 
spacetime with the image of the little square, a parallelogram whose sides are the vectors 
dv!f and dv^. 



Why do we capitalize the XI Suppose we used the same symbol to denote spacetime coor- 
dinates and mapping functions. Then we could still distinguish between them by writing 
x IA or x ii (t. a), but we would not have the luxury of dropping the (r, a) arguments. On the 
other hand, with X ^ we can drop the (r, a) arguments and still know that we are talking 
about the mapping functions of the string. We will call X ^ the string coordinates. 

As before, the parameters r and o can be viewed as coordinates on the world-sheet, 
at least locally. The map inverse to X 1 ' takes the world-sheet to the parameter space, and 
locally it assigns to each point on the surface two coordinates: the values of the parameters 
r and cr. Introducing some potential for confusion, physicists also use the term world- 
sheet to denote the two-dimensional parameter space whose image under X M gives us 
the . . . world-sheet! Unless explicitly stated, we will reserve the use of the term world- 
sheet for the spacetime surface. In Figure 6.4 we consider an open string: to the left, you 
see the parameter space surface and, to the right, you see the spacetime surface. In this 
parameter space, a ranges over a finite interval, while r may extend from minus infinity 
to plus infinity. The parameter r is roughly related to time on the strings - much more 
on this later - and the parameter a is roughly related to positions along the strings. The 
world-lines of the string endpoints have constant a , so they are parameterized by r . As r 
flows, time must flow. Thus, at least at the endpoints 



9X° 

dr 



/ 0 . 

endpoint 



(6.29) 



We will assume that this also holds for other values of a . 



To find the area element we proceed as in the case of the spatial surface, this time using 
relativistic notation. The situation is illustrated in Figure 6.4. A little rectangle of sides 
dr and do in parameter-space becomes a quadrilateral area element in spacetime. This 
quadrilateral is spanned by the vectors dv ^ and d Vj . Furthermore, 



dvf 



d -^dr, d^ = ^do, 



dr 



da 



(6.30) 
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which are analogous to our earlier spatial formulae (6.2). We can now use the analog of 
(6.4) as a candidate for the area element dA : 

dA = y/ (dv i • dvi)(dv 2 ■ dv 2 ) — ( dv\ ■ dv 2 ) 2 , (6.31) 



where the dot is the relativistic dot product. Using this dot product guarantees that the area 
element is Lorentz invariant: it is a proper area element. We wrote a question mark on top 
of the equal sign because there is one problem. Even though this is not obvious to us yet, 
the sign of the object under the square root is negative. To be able to take the square root we 
must exchange the two terms under the square root. This change of sign has no effect on 
the Lorentz invariance. Doing this, and using (6.30), we find that the proper area is given as 



-/ 



A = drda 



/ dXf- 


/ dXt 1 dX„s 


/dX v 


dX v \ 


v 3r da J 


V 3r 9r / 


V da 


da ) 



(6.32) 



Using the relativistic dot product notation. 



-f 



A = I drda 



(dX 


dX\2 


( dXy 


- / dX \ 2 


V dr 


da) 


V 9r / 


V 9(7 / 



(6.33) 



To understand why the above sign is correct we must convince ourselves that the expression 
under the square root is greater than or equal to zero at any point on the world-sheet of a 
string. 



What characterizes locally the spacetime surface traced by a string? The answer is quite 
interesting. Consider a point on the world-sheet and the set of all vectors tangent to the 
surface at that point. These vectors form a two-dimensional vector space. We claim that 
in this vector space there is a basis made by two vectors, one of which is spacelike and 
one of which is timelike. This implies that at each point on the world-sheet there are both 
timelike and spacelike tangent directions. There is a small caveat: on each fixed-time string 
there can be a finite set of exceptional points where the tangents to the world-sheet do not 
include a timelike vector. At those points, as we will see, the string moves with the speed 
of light. 

The string we are trying to define cannot have finite size pieces moving with the speed 
of light. In such case, a string would contain a continuous set of points, not just a finite 
set, where the tangents to the world-sheet do not include timelike vectors. Imagine, for 
example, a straight piece of string along the x axis moving with the speed of light in the 
y direction. At any point on this string all vectors tangent to the world-sheet are spacelike, 
except for null vectors in one particular direction. To see this consider Figure 6.5 where we 
show this string at various closely separated times. Tangent vectors at P are well approx- 
imated by vectors that joint the event P to a nearby event P' on the world-sheet. Since P 
and P' are points on the same string at different times, the spatial part of the tangent vector 
appears in the figure as the arrow joining P to P' . Consider all arrows joining P to points 
on a semicircle around P. Any world- sheet tangent direction at P is represented by one 
of these arrows. The world-sheet tangent vector associated with P Q is clearly spacelike, 
since P and Q occur at the same time. The typical tangent, that associated with the arrow 
PR is still spacelike: in the elapsed time P can get to R moving at the speed of light, but to 
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A string along the x direction moving with the velocity of light along the y direction. This is 
not allowed motion. All tangent vectors to the world-sheet at any point P on the string are 
either spacelike or null. 

get to R it must move faster than light. All tangent directions are spacelike, except for that 
associated with PS, which is null. This shows that there is no timelike tangent at P. Since 
P is a generic point along the original piece of string, nowhere on this piece of string is 
there a timelike vector tangent to the world-sheet. 

At any point on a world-sheet a spacelike direction exists and is easy to visualize: if you 
take a photograph of the string at some time, every tangent vector along the length of the 
string points in a spacelike direction. Indeed, in your frame, the events defining the string 
are simultaneous but spatially separated. 

To appreciate the need for a timelike vector at regular points on the world-sheet, consider 
first the world-line of a point particle. The tangent vector to the world-line is timelike. At 
each point on the world-line this tangent vector can be used to describe an instantaneous 
Lorentz observer who sees the particle at rest. A spacelike tangent vector to the world-line 
is unphysical: it describes a particle moving faster than the speed of light. Strings are a little 
more subtle since there is no way to tell how individual points on them move. As we shall 
make abundantly clear, the string is not made of constituents whose position we can keep 
track of (exception: one can keep track of the endpoints of an open string). Still, a timelike 
tangent to the world-sheet at a given point of a string allows us to describe an instantaneous 
Lorentz observer who sees the point at rest. If there is a timelike tangent at a point, there 
are many, by continuity. Each of these timelike tangents defines a different instantaneous 
Lorentz observer who sees the point at rest. This is consistent with our inability to track 
unambiguously the motion of points on a string. If we watch a string at two closely sepa- 
rated times we cannot tell which point went where, but for each point p on the final string 
we must be able to find some point p' on the initial string that could reach p moving with 
speed less than or at most equal to c. 

The existence of both timelike directions and spacelike directions at any regular point 
on the world-sheet is our criterion for physical motion. It guarantees that equation (6.33) 
makes sense. 

Claim: At any point P on the world-sheet where there is both a timelike direction and a 
spacelike direction, the quantity under the square root in (6.33) is positive: 
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> 0 . 



(6.34) 
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Proof: Consider the set of tangent vectors v IL (X) at P obtained as: 



v»(X) 



dx p 

dr 



+ X 



dX * 

da 



(6.35) 



where X e (— 00 , 00 ) is a parameter. Since dX^ /dr and dX^/da are linearly independent 
tangent vectors, when we vary X we get, up to constant scalings, all tangent vectors at P, 
including dX 11 /da, which is obtained in the limit X — > 00 (Figure 6.6). Constant scalings 
of a vector do not matter to decide if a vector is timelike or spacelike. To determine if v 1 ' ( X ) 
is timelike or spacelike, we consider its square: 



9 „ WdX\2 /HA iJA\ / dX \ 

" m x (77) + 2 i (a 7 ' a?) + ( 77 ) 



dx dX\ /dX \2 



(6.36) 



The dot products appearing on the final right-hand side are just numbers, so we have a 
quadratic polynomial in X. To have both timelike and spacelike tangent vectors at P , v 2 (X) 
must take both negative and positive values as we vary X. In other words, the equation 
v 2 (X) — 0 must have two real roots. For this to happen, the discriminant of the quadratic 
equation v 2 (X) — 0 must be positive. From (6.36) we see that this requires 
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dX\2 
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' 9it / 
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(6.37) 



which is precisely the condition (6.34) we set out to prove! 



Since we always have spacelike tangents, the plot of v 2 (X) in Figure 6.6 must include 
a region where i> 2 (k) > 0. Consider a point P on the world-sheet where all tangent direc- 
tions are spacelike with the exception of one that is null. Then v 2 (X) > 0, except for one 
value of X where v 2 vanishes. The equation v 2 (X) = 0 must have a single root and the 
associated discriminant is zero. It follows that the quantity under the square root in the 
action (6.33) is zero at P . Any possible motion of the string at P must be associated with 
a world-sheet tangent at P. Since motion along spacelike directions is unphysical, only 
the null vector provides an acceptable answer: the string is moving with the speed of light 
at P . 




9X" + 

dr 



a3X" 

da 




Left: a set of tangent vectors v(k) at a point P on the world-sheet. Right: a plot of v 2 (k) as 
a function of X. The vector v(A.) may be spacelike or timelike depending on the value of X. 
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6.4 The Nambu-Goto string action 



Now that we are sure that the proper area functional in (6.33) is correctly defined, we can 
introduce the action for the relativistic string. This action is proportional to the proper area 
of the world-sheet. To have the units of action we must multiply the area functional by 
some suitable constants. 

The area functional in (6.33) has units of length-squared, as it must be. This is because 
has units of length, and each term under the square root has four X . The units of r 
and a cancel out. Each term in the square root has two a derivatives and two r derivatives. 
Their units cancel against the units of the differentials. Nevertheless, we will take a to have 
units of length and r to have units of time. We do this anticipating a relation between r 
and time and between a and positions on strings. To summarize: 

[t] = T, [a ] = L, [X fl ] = L, [A] = L 2 . (6.38) 

Since S must have units of ML 2 /T and A has units of L 2 , we must multiply the proper 
area by a quantity with units of M/T. The string tension Tq has units of force, and force 
divided by velocity has the desired units of M/T . We can therefore multiply the proper 
area by Tq/c to get a quantity with the units of action. Making use of (6.33) we set the 
string action equal to 






l f dr s: daJ(X ■ X') 2 - ( X) 2 (X O 2 . (6.39) 



Here esq > 0 is some constant, and we have introduced some notation for derivatives: 



X 11 = 



dX^ 

3r 



X^' = 



8Xf 1 
3 a 



(6.40) 



Of course, we have not yet confirmed that the symbol Tq in the string action has the precise 
interpretation of tension, but we will do so in Section 6.7. We will also confirm there 
that the overall negative sign which multiplies the action is correct. The action S is the 
Nambu-Goto action for the relativistic string. 

It is crucial that this action be reparameterization invariant. We can proceed just as we 
did with spatial surfaces to write the Nambu-Goto action in a manifestly reparameteriza- 
tion invariant way. In this case we have 



dX^ dX v 

- ds 2 = dX^dX^ = m.v dX»dX v = n ltv -—— w d\/ a d^ . (6.41) 

3§“ 3§ p 



Here r]^ v is the target-space Minkowski metric. The indices a and f J > run over two values, 
1 and 2, and we have taken f 1 = r, £ 2 = a. Just as we did for spatial surfaces, we define 
an induced metric Yap on the world-sheet: 



dX * dX v _ dX dX 

~d^ ~W l W' 



Yap = *1ii 



(6.42) 
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More explicitly, the 2-by-2 matrix y a p is 



( X ) 2 X ■ X ' 



Y<*P ~ , 

_ X ■ X' (X') 2 



( 6 . 43 ) 



With the help of this metric we can write the Nambu-Goto action in the manifestly 
reparameterization invariant form 



The analysis in Section 6.2 of reparameterization invariance for spatial surfaces holds, 
without change, in the present case. Not only is the action (6.44) manifestly reparameteri- 
zation invariant, it is also more compact than (6.39). In this form, one can readily generalize 
the Nambu-Goto action to describe the dynamics of objects that have more dimensions 
than strings. An action of this kind is useful as a first approximation to the dynamics of 
D-branes. 



6.5 Equations of motion, boundary conditions, and D-branes 



In this section we will obtain the equations of motion that follow by variation of the string 
action. In doing so we will also have an opportunity to discuss the various boundary con- 
ditions that can be imposed on the ends of open strings. Dirichlet boundary conditions will 
be interpreted to arise owing to the existence of D-branes. 

Let us begin by writing the Nambu-Goto action (6.39) as the double integral of a 
Lagrangian density £: 




( 6 . 44 ) 




( 6 . 45 ) 



where £ is given by 




( 6 . 46 ) 



We can obtain the equations of motion for the relativistic string by setting the variation of 
the action (6.45) equal to zero. The variation is simply 
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where we have used 

/dX^\ 3 (8X») 

8X' l = s( — ) = -L, (6.48) 

V 3r / 3r 

and an analogous equation for SX^' . 

The quantities dC/dX 11 and UC/dX' 1 ' will appear frequently throughout the remainder 
of our discussion, so it is useful to introduce new symbols for them. This is just what we 
did when we studied the nonrelativistic string in Section 4.6. This time we find 



, t = dC_ _ Tp (X • XQX; - (X 7 ) 2 ^ 

" ~ dX» ~ c y ( x . x ') 2 - (X) 2 (X') 2 ’ 

= To 

M dXV C 7(i . x ') 2 - (X) 2 (X f ) 2 ' 



(6.49) 

(6.50) 



Quick calculation 6.3 Verify equations (6.49) and (6.50). 



Using this notation, the variation SS in (6.47) becomes 



SS = 



r T f r a i 

1 dr I da 

Jii J 0 



— (SX^V*) 
dr M 



3 

da 



/ dV T 

(dX^Vl) -SX^i M 



dr 



dV^ 

da 



. (6.51) 



The first term on the right-hand side, being a full derivative in r, will contribute terms 
proportional to SX^fjf, a) and SX^fji, a). Since the flow of r implies the flow of time, 
we can imagine specifying the initial and final states of the string, and we restrict our- 
selves to variations for which SX^(rf, a) = <5A M (r,-,er) = 0. We will always assume such 
variations, so we can forget about these terms. The variation then becomes 



SS = 



r T f f r f /’"i 

/ dr [SX^V*]^ - / dr da SX» 
J Zi J Ti JO 



d_K 

dr 



dvi 

da 



(6.52) 



Since the second term on the right-hand side must vanish for all variations SX 1 ' of the 
motion, we set 



dVl dV° 

I H- 

dr da 



= 0 . 



(6.53) 



This is the equation of motion for the relativistic string, open or closed. A quick glance 
at definitions (6.49) and (6.50) shows that this equation is incredibly complicated. The 
key to its solution will lie in the reparameterization invariance of the Nambu-Goto action. 
Choosing a clever parameterization will simplify our work enormously. 

The first term on the right-hand side of (6.52) has to do with the string endpoints. It is, 
in fact, a collection of terms that includes two terms for each value of the index /x. More 
explicitly, the list is 
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J f dr^SX°(r,(7 l )Vo(r,(j l )-8X 0 (r,0)Vo(r,0) 

+ 8X\t, <n)Pf ( r, «ri) - 8X l (r, 0)Pf (t, 0) 

+ 8X d { r, rr,)PJ (r, CTl ) - <5X d (r, 0)PJ (r, 0)) . (6.54) 

We need a boundary condition for each term in the above list. This is a total of 2D — 
2 (d + 1) boundary conditions. 

Let us focus on a single term, that is, we fix /i and select one endpoint. Let a * denote the 
a coordinate of an endpoint; a* can be equal to zero or equal to cri . Selecting an endpoint 
means fixing the value of er*. As before, there are two natural boundary conditions that 
one can impose at an endpoint. The first is a Dirichlet boundary condition, in which the 
endpoint of the string remains fixed throughout the motion: 



Dirichlet boundary condition: 



dX » 

— — (r, cr*) = 0, 
3r 



M # 0. 



(6.55) 



Since time varies as r varies (see (6.29)), the value /i — 0 must be excluded. Dirichlet 
boundary conditions are only possible for space directions. Given that constancy in r 
means constancy in time, equation (6.55) implies that the [x, coordinate of the selected 
string endpoint is fixed in time. Alternatively, rather than requiring that the r derivative 
vanishes, we could simply specify a constant value for X^(t, er*). If the string endpoint 
is fixed, the variations are set to vanish there: 8X ll (r , er*) = 0. This guarantees that the 
relevant term in (6.5) vanishes. 

The second possible boundary condition is a free endpoint condition: 



free endpoint condition : V a (r, er*) = 0 . (6.56) 



This condition, as needed, also results in the vanishing of the relevant term in (6.5). This is 
called a free endpoint condition because it does not impose any constraint on the variation 
8X^(t, cr*) of the string coordinate at the endpoint. The endpoint is free to do whatever is 
needed to get the variation of the action to vanish. The free endpoint boundary condition 
must apply for n — 0: 



"Pq (r, <ti) = (r, 0) = 0 . 



(6.57) 



For the nonrelativistic string, the free endpoint boundary condition implies the vanishing 
of V x , which imposes a Neumann boundary condition on the string coordinate (see (4.47)). 






6.5 Equations of motion, boundary conditions, and D-branes 




A D2-brane stretched over the (x 1 , x 2 ) plane. The endpoints of the open string can move 
freely on the plane, but must remain attached to it. The coordinate x 3 of the endpoints 
must vanish at all times. This is a Dirichlet boundary condition for the string coordinate X 3 . 

We will eventually understand (6.56) in terms of a Neumann boundary condition. Similarly, 
the Dirichlet boundary (6.55) will be shown to imply the vanishing of V at the string 
endpoints. 

The boundary conditions (6.55) and (6.56) can be imposed in many possible ways. For 
each spatial direction, and at each endpoint, we can choose either a Dirichlet or a free- 
endpoint boundary condition. Since closed strings have no endpoints, they do not require 
boundary conditions. 

Let us elaborate on the case of Dirichlet boundary conditions. It is clear from the study 
of nonrelativistic strings that Dirichlet boundary conditions arise if string endpoints are 
attached to some physical objects. Consider, for example, Figure 4.2. On the left, the string 
is attached to two points. On the right, the string is free to slide up and down at the end- 
points; the string endpoints are forced to stay on one-dimensional lines and horizontal 
motion of the endpoints is forbidden. The objects on which open string endpoints must lie 
are characterized by their dimensionality, more precisely, by the number of spatial dimen- 
sions that they have. They are called D-branes, where the letter D stands for Dirichlet. The 
objects which fix the string endpoints on the left side of Figure 4.2 are zero-dimensional. 
They are called DO-branes. The lines which constrain the string endpoints on the right side 
of the figure are one dimensional. They are called Dl-branes. 

A Dp-brane is an object with p spatial dimensions. Since the string endpoints must lie on 
the D p-brane, a set of Dirichlet boundary conditions is specified. A flat D2-brane in a three- 
dimensional space, for example, is specified by one condition, say x 3 = 0 (Figure 6.7). 
This means that the D2-brane extends over the (x 1 , x 2 ) plane. The Dirichlet boundary 
condition applies to the string coordinate A 3 , which must vanish at the string endpoints. 
Since the motion of open string endpoints is free along the directions of the brane, the string 
coordinates X 1 and X 2 satisfy free boundary conditions. When the open string endpoints 
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have free boundary conditions along all spatial directions, we still have a D-brane, but this 
time it is a space-filling D-brane. The D-brane extends all over space, and since open string 
endpoints can be anywhere on the D-brane, open string endpoints are completely free. 

For (quantum) relativistic strings the consistency of Dirichlet boundary conditions 
allows one to discover the properties of D-branes. D-branes are physical objects that exist 
in a theory of strings and are not introduced by hand. D-branes need not have infinite extent 
nor are they necessarily hyperplanes. They have calculable energy densities and a host of 
remarkable properties. We will study D-branes in detail beginning in Chapter 15. 



6.6 The static gauge 



To make progress in understanding the action for the relativistic string, we must param- 
eterize the world-sheet in a useful way. We are allowed to choose the parameterization 
freely because of the reparameterization invariance of the string action. Reparameterization 
invariance in string theory is analogous to gauge invariance in electrodynamics. Maxwell’s 
equations possess a symmetry under gauge transformations that allows us to use different 
potentials A to represent the same electromagnetic fields E and B. A suitable choice of 
gauge helps to uncover the physics. Similarly, we may use many different grids on the 
world-sheet to describe the same physical motion of the string. A suitable choice of grid 
can make this task much easier. A good choice of parameterization was useful even for 
the relativistic point particle - its equation of motion is simplest when the trajectory is 
parameterized by proper time. 

In this section, we will discuss only a partial parameterization on the world-sheet. We will 
fix the lines of constant r by relating r to the time coordinate X° = ct in some chosen 
Lorentz frame. 

Consider the constant time hyperplane t = to in the target space (Figure 6.8). This plane 
will intersect the world-sheet along a curve - the string at time to according to observers 
in our chosen Lorentz frame. We declare this curve to be a curve of constant r; in fact, we 
declare it to be the curve r = to. Extending this definition to all times t, we declare that for 
any point Q on the world-sheet 

r(Q) = t(Q). (6.58) 

This choice of r parameterization is called the static gauge because lines of constant r are 
“static strings” in the chosen Lorentz frame. 

We will not try to make a sophisticated choice of a at this time. For an open string, we 
will choose one edge of the world-sheet to be the curve a = 0 and the other edge to be the 
curve a — oq : 

a e [0, cti] , for an open string. (6.59) 

We draw lines of constant a on the surface quite arbitrarily, provided, of course, that con- 
stant a lines vary smoothly, do not intersect, and are consistent with the two curves which 
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a 



A 















2 - 






Left: the parameter space strip for an open string. The vertical segment AB is the line 
r = f 0 . Right: the open string world-sheet in target space. The string at time t = t 0 
is the intersection of the world-sheet with the hyperplane f = f 0 . In the static gauge, the 
string at time f = f 0 is the image of the r = f 0 segment AB. 



are the boundary of the world-sheet (Figure 6.8). Drawing lines of constant a is equiva- 
lent to giving an explicit a parameterization to all the strings. For closed strings the same 
ideas apply, but there is a significant proviso: there must be an identification in the (r, er) 
parameter space. The er direction must be made into a circle, making the (r, er) parameter 
space into a cylinder. This is needed because the closed string world-sheet is topologically 
a cylinder. Letting er c denote the circumference of the a circle, the identification is 

(r, er) ~ (r, er + er f ) . (6.60) 

Points that are identified by this relation on the parameter space map to the same point on 
the closed string world-sheet. The closed strings can be parameterized using any er interval 
of length cr e , for example 

a e [0, er c ] , for a closed string. (6.61) 

Let us now explore some implications of our choice of r. We can write (6.58) as 

X°(j, er) = c t (r, er) = c r , (6.62) 



or simply 

r — t . 

We can thus describe the collection of string coordinates X 1 ' as 
X^(t, er) = X^(t, a) = { ct , X(t, er)} , 
letting the vector X represent the spatial string coordinates. We then find 



dX^ 

dcr 

dX^ 

dr 



= 



= 



/8X° dX\ 
VlkT’ ~da ) 
/8X° 3X\ 

Ur ~dt) 



-(*£)■ 



(6.63) 



(6.64) 



(6.65) 



As you can see, this parameterization separates the time and space components quite neatly. 
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Now that we have made a choice of r coordinates, we can do a simple test to confirm that 
we got the right sign under the radical in the Nambu-Goto action (6.39). Imagine a little 
piece of string with no velocity. Because it is not moving, dX/dt = 0, and using (6.65), 
the square root in (6.39) becomes 



/ 



0 - 




(6.66) 



The quantity under the square root is positive, just as we expected. If some day you forget 
the sign under the radical in the string action, this is a good way to check it quickly. 



6.7 Tension and energy of a stretched string 



Let us now do our first calculation with the Nambu-Goto action - our first calculation in 
string theory! We are going to analyze a stretched relativistic string. The endpoints of the 
string are fixed at x 1 = 0, and at x 1 = a >0, with vanishing values for the coordinates 
of the additional spatial dimensions. We therefore denote the spatial coordinates of the 
endpoints as (0, 0) and (a, 0). The inclusion of the common (cl — l)-dimensional vector 0 
tells us that the string is only stretched along the first spatial coordinate. 

We evaluate the string action for this stretched string using the static gauge X° = cr. 
Because this is a static string stretched from x 1 = 0 to x 1 = a, we can write 

X 1 (t,a) = f(a), X 2 = X 3 = -.. = X d = 0, (6.67) 

where 

f (0) = 0 , f(<Ti)=a, (6.68) 

and the function f (a) is strictly increasing and continuous on the interval a e [0, eri]. The 
setup is illustrated in Figure 6.9. The function / must be strictly increasing to ensure that 
each point along the string is assigned a unique a coordinate. 




A string of length a stretched along the x 1 axis. The string is parameterized as 
X\t,a) = f(a). 
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It now follows that 

X^ — (c, 0, 0), X ,IJ - = (0, /', 6), (6.69) 

with f = df/do > 0. Therefore 

(X) 2 = - c 2 , (X') 2 = (f') 2 , X ■ X' = 0. (6.70) 

We can now evaluate the action (6.39): 

S = ~— f f dt [ 1 doJ 0 — (— c 2 )(/') 2 = -T 0 f f dt [ ' do — . (6.71) 

c Jt i Jo v J ti Jo do 

The o integrand is a total derivative, so 

S = -To f f dt ( f(oi ) - /(0)) = r dt (-Too ) , (6.72) 

J tj J t[ 

where we used (6.68). Note that the value of the action does not depend on the function / 
used to parameterize the string. This is an explicit confirmation of the reparameterization 
invariance of the string action. 

We would like to interpret our result. For this, recall that the action is the time integral 
of the Lagrangian L. When the kinetic energy vanishes, L = — V, where V is the potential 
energy. Since our string is static, there is no kinetic energy, so 

S = [ 1 dt(-V). (6.73) 

Jti 

Comparing this with (6.72) we conclude that 

V = T 0 a . (6.74) 

The potential energy of our stretched string is just Toa. What does this mean? If the tension 
of a static string is To, regardless of its length, then Tqci is the amount of energy that you 
must spend to create a string of length a. Imagine that you start with an infinitesimal string 
and you start pulling it. As you do work you are giving energy to the string, in fact, you are 
creating rest energy, or rest mass. The rest mass no per unit length is 

9 V T 0 

Mo c 2 = — = T 0 — > MO = — ■ (6.75) 

a c- 

The mass (or rest energy) arises only because the string has a tension. Because of this, 
the relativistic string is sometimes referred to as a massless string. The above calculation 
supports the identification of To as the string tension. It also confirms that the minus sign 
in front of the action (6.39) is necessary - otherwise the potential energy of the stretched 
string would have come out negative. 

There is one point that we have glossed over. We assumed in our analysis that the config- 
uration (6.67) satisfies the string equations of motion. If it does not, then the configuration 
cannot be physically realized. Let us check that the equations of motion are satisfied. 
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First note that on account of (6.69) neither X 1 ' nor X 11 ' has r dependence. Therefore 
neither V T nor V n has r dependence (see (6.49) and (6.50)). This being the case, the 
equation of motion (6.53) reduces to 



dVl 

3 a 



= 0 . 



( 6 . 76 ) 



This requires that V a be a -independent. We look again at (6.50) and use (6.70) to find 



-p or 
r M 



To ^ 

c y/c 2 (f ') 2 ° f ' 



( 6 . 77 ) 



This is nonvanishing only for /z = 1, in which case Xj = /', so 'P fI is indeed ct- 
independent. Thus the equation of motion is satisfied. Even the boundary conditions are 
satisfied. As we discussed in Section 6.5, there is no condition to check for string coordi- 
nates that satisfy Dirichlet boundary conditions at the endpoints. In our problem this means 
that there are no extra conditions to be checked for any of the spatial coordinates. For the 
zeroth coordinate, equation (6.57) requires the free boundary condition Vq = 0 . This holds 
on account of (6.77). 



6.8 Action in terms of transverse velocity 



We have chosen a partial parameterization of the world-sheet by imposing the condition 
X° = ct = cr . With this choice, a line of constant r on the world-sheet corresponds to the 
string, as seen by our chosen Lorentz observer, at the particular time t = r. 

Can we define some sort of string velocity? Since the components of X(t, ct) are the 
string spatial coordinates, the derivative 3 X /3 1 seems to be the closest thing we have to a 
velocity. This velocity, however, depends upon the choice of a . Its direction, for example, 
goes along the lines of constant a . Since a can be chosen quite arbitrarily, keeping a 
constant in taking the derivative is clearly not very physically significant! 

Fixing physically the a parameterization of a string is subtle because the string is an 
object with no substructure. When comparing a string at two nearby times, it is not possible 
to say that a point moved from one location to the next. To speak of points on the string we 
need a a parameterization, and reparameterization invariance makes it clear to us that this 
parameterization is not unique. This suggests that longitudinal motion on the string is not 
physically meaningful. 

There is a reparameterization invariant velocity that can be defined on the string. This is, 
however, a transverse velocity. We consider the string motion in space, and imagine that 
each point on the string moves transversely to the string (Figure 6.10). Consider a string at 
some fixed time t and pick a point p on it. Draw the hyperplane orthogonal to the string at 
p. At time t + dt, with dt infinitesimal, the string has moved, but it will still intersect the 
plane, this time at a point p' . The transverse velocity is what we get if we presume that the 
point p moved to p'. No string parameterization is needed to define this velocity. 
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A string at time t and the plane orthogonal to the string at p. At time t+ dt the string 
intersects the plane at p' . To define the transverse velocity we assume that p moved to p' . 



When speaking of evolving strings there are two surfaces we can discuss. One is the 
world-sheet, the surface in spacetime which represents the history of the string. The other 
is a surface in space. This spatial surface is put together by combining the strings that 
we observe at all times. This is the surface that would be generated if the string were 
to leave a wake as it moved. The transverse velocity v± at any point on the string is a 
vector orthogonal to the string and tangent to the string spatial surface. Since r3j_ is a string 
reparameterization invariant notion of velocity, we expect it to enter naturally into the 
evaluation of the string action. 



In order to define the transverse velocity vj_, it is useful to have a unit vector tangent to 
the string. To this end, we now introduce a parameter s which is more physical than our 
nearly arbitrary a : s measures length along the string. Let us work with the string at a fixed 
time, and define s(a) to be the length of the string in the interval [0, a], Thus, for example, 
s(0) = 0, and s(a\ ) is the length of an entire open string. Since ds is the length of the 
infinitesimal vector dX which arises from an interval dcr along the string, we have: 



ds = \dX\ = 



dX 

da 



\da\ . 



(6.78) 



Now consider the quantity dX/d s, which is the rate of change of X with respect to the 
length of the string. First note that it is a unit vector: 



dX dX _ dX dX/da\2_ dX 
ds ds da da V ds ) da 

The derivative dX/da is taken with t held fixed, so it lies along a line of constant t. Since 
the lines of constant t are precisely the strings, it is tangent to the string. In addition 



b = 1 • <6 - 79 > 



dX _ dX da 
ds da ds 

and thus dX/ds is also tangent to the string. Because it has unit length. 



(6.80) 



dX 

ds 



is a unit vector tangent to the string . 



(6.81) 
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A small piece of the world-sheet showing the vector 3X/3f, the transverse velocity and 
the unit vector dX/ds. 



We define v±_ to be the component of the velocity dX/dt, in the direction perpendicular 
to the string (see Figure 6.11). For any vector u, its component perpendicular to a unit 
vector n is u — (u ■ n)n. Therefore, using our unit vector dX/ds along the string, we have 



_ _ dx /dx dx\dx 
v± ~ ~di~ \~dT " 37 / 77 ' 

For future use, we calculate i>7 . A small computation gives 



(dX\ 2 


/dX 


3X\ 


v"37/ 


v"37 ' 


' 37/ 



(6.82) 



(6.83) 



Our goal now is to write the string action in terms of u_l and other quantities, if necessary. 
Using the static gauge r — t, and equations (6.65), we find 



(X) 2 = -c 2 + ( : 



, 7 / 3X\ 2 

< x > = (sr) ■ 



/ 3X\ 2 

With these relations we simplify the argument of the square root in the string action: 



, dx dx 

XX' = . 

3 1 3 a 



(6.84) 



(X ■ X') 2 



■ , . , /dX dX\2 r , / 3X\2 ■ 

m tX ) - = (-* ■ d + P - ( ar) 

/ds\2 w dX dX\2 



\da 



)[( 



3 1 ds J 



+ c 



3 1 
2 



/dX\2 
V3 a) 
/dX \2 



(6.85) 



The terms on the right-hand side above can be neatly expressed in terms of vj_. Making 
use of (6.83), 



(X ■ X') 2 - (X) 2 {X') 2 = (^-) 2 (c 2 - «i) , 



(6.86) 
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or, alternatively, 




(6.87) 



This simple expression for the string Lagrangian density shows that u_l is a natural dynami- 
cal variable. Moreover, the longitudinal component of the velocity is completely irrelevant. 
Now we can write the string action as 



Here ds/da = \dX/do\. We did not cancel the dc r because it is typically useful to have 
an integral over a fixed parameter range. While the range of a is constant, the total length 
of a string is time dependent. We introduced s as the function of a that gives length along 
the string at a fixed time. This definition, used at different times, endows .s' with a time 
dependence that can be relevant if we compare strings at different times. 

The associated Lagrangian is given by 



This formula was written as an integral over the length parameter in order to give an 
interpretation. For each infinitesimal piece of string. Tods is its rest energy. As a result, the 
Lagrangian is an integral over the string of (minus) the rest energy times a local relativis- 
tic factor. In this form, we recognize (6.89) as the natural generalization of the relativistic 
particle Lagrangian (5.8). 

The action (6.88) is valid both for open strings and for closed strings. Although relatively 
simple, it still leads to rather complicated equations of motion in all but the most symmet- 
rical situations. In order to obtain simple equations of motion, we will have to be clever 
in our choice of a . For open strings, in addition, we must understand how the endpoints 
move. 

We conclude this section by simplifying our expressions (6.49) and (6.50) for V T/i and 
V n ) ' in the static gauge. Let us begin with V alL . Its denominator is given in (6.87) and its 
numerator is simplified using relations (6.84). We find 




( 6 . 88 ) 




(6.89) 




(6.90) 
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Bringing the ds/da from the denominator up to the numerator, we can turn derivatives 
with respect to a into derivatives with respect to s: 



= _ 



T 0 



dx dx 

ds d t 



X 11 + I c 2 



dX 



8X' 1 



' 1 - 



(6.91) 



The fi — 0 component of this quantity simplifies considerably. Since X° — c and 
dX°/ds = c dt /ds = 0, we find 



■pc r0 



dX dX 
d s dt 






A rather similar calculation for V TI ‘ gives 

X* * 

„ = 7o ^ 

c 2 da 

Quick calculation 6.4 Prove (6.93). 

It follows from (6.93) that V z0 and V T are given by 



dX dX\ dX * 
ds dt ) ds 






r r 0= 7 /0^_ 

c da 



1 



1 - 

1 ^2 



- = 7b ds_ v± 
c 2 da 



1 - -A 

1 r- 



(6.92) 



(6.93) 



(6.94) 



6.9 Motion of open string endpoints 



We will now analyze the motion of the endpoints of an open relativistic string. We consider 
endpoints that are free to move in all directions. Given our discussion in Section 6.5, this 
means that we have a space-filling D-brane. Free endpoints are specified by the boundary 
conditions (6.56), which require the vanishing of V" at the endpoints. We will discover 
two important properties of the free motion of open string endpoints. 

• The endpoints move with the speed of light. 

• The endpoints move transversely to the string. 

On the interior of the string the notion of a velocity was ambiguous. For the string end- 
points, however, the velocity is well defined - there is no ambiguity defining the velocity of 
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points! Therefore, our statements about endpoint motion have content. In the second state- 
ment, motion transverse to the string means that the velocity of an endpoint is orthogonal 
to the tangent to the string at the endpoint. 



To prove the above properties we first recall that V a0 must vanish at the endpoints. Using 
(6.92) and noting that the square root in the denominator cannot be infinite, we deduce that 
the numerator must vanish: 

dX dX 

— • — = 0 at the endpoints . (6.95) 

ds 3 1 

Since dX/ds is a unit vector tangent to the string, and dX/dt is the endpoint veloc- 
ity, this equation proves that the endpoints move transversely to the string - one of our 
two claims. In agreement with this interpretation, using (6.95) in (6.82) we see that at 
the endpoints v±_ = dX/dt = v. Equation (6.95) actually allows for vanishing endpoint 
velocity, in which case the transversality property would be trivially satisfied. But this 
cannot happen; the endpoints move with the speed of light, as we now show. 

Using (6.95), we simplify the expression (6.91) for V al ' at the endpoints: 




3Z M 

~aT 



at the endpoints. 



For the space coordinates, /x = \ ..... d. equation (6.96) gives 



(6.96) 



/ v 2 926 

V — —ToJ 1 r — =0 at the endpoints. 

V c 1 ds 



Since dX/ds is a unit vector, we conclude that 



c 2 . 



(6.97) 



(6.98) 



Free open string endpoints move with the speed of light. 

This conclusion implies that the denominator in (6.92) actually vanishes at the endpoints. 
Equation (6.95) must still hold, otherwise V a0 would diverge, rather than vanish at the 
endpoints. In fact, as we approach the string endpoints, the numerator in (6.92) must vanish 
faster than the denominator to ensure that the ratio goes to zero. 



Problems 



Problem 6.1 The induced metric on a two-dimensional surface. 

A two-dimensional surface in flat three-dimensional space is described by the height func- 
tion j = h (x , y ), so that points on the surface take the form (x, y, h (x , y)). This surface is 
naturally parameterized by x and y. Recall (6.14), which gives the metric gij( f) induced 
on a surface parameterized by (£*, £ 2 ). 

(a) Calculate the components of the metric gjj in terms of h and its derivatives. Write a 
formula for gjj of the form g, j = 8 U H ■ 
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(b) Write the fully simplified appropriate form of the area integral (A = f d^d^^/g). 
Problem 6.2 Metric on S 2 from stereographic parameterization. 

Consider a unit sphere S 2 in R 3 centered at the origin: x 2 + y 2 + z 2 = 1. Denote a point 
on the sphere by x — (x, y, z). In the stereographic parameterization of the sphere we use 
parameters £* and § 2 and points on the sphere are (see (6.1)): 

x (g \ ? 2 ) = (*(£\£ 2 ) , $ 2 ) , z{S\ ? 2 )) . 

Given parameters (£ J , £ 2 ), the corresponding point on the sphere is that which lies on the 
line that goes through the north pole N = (0, 0, 1) and the point (£ 1 . f 2 , 0). Note that the 
north pole itself is not attained for any finite values of the parameters. 

(a) Draw a sketch for the above construction. What are the required ranges for f 1 and § 2 
if we wish to parameterize the full sphere (except for the north pole)? 

(b) Calculate the functions x(^\ f 2 ), y(§ J , § 2 ), and z (£ l , £ 2 ). 

(c) Calculate the four components of the induced metric gij(^). This is the metric on the 
sphere, described using the £ parameters. The algebra is a bit messy, but the result is 
quite simple (use a symbolic manipulator!). 

(d) Check your result by computing the area of the sphere using (6.17). 

Problem 6.3 Schwarz inequality in R 11 . 

Consider a two-dimensional vector space V with a constant metric such that there is a 
timelike vector t' ( t ' 2 = t'-t'< 0) and a spacelike vector s' (s' 2 = s' ■ s' > 0). 

(a) Show that you can construct vectors t and s such that t ■ t = — l, s • s = 1, and t ■ s = 

0. [Hint: choose t to be in the direction of t' .] The vectors t and .v provide a canonical 
basis for V, which is now identified as the space R 11 . 

(b) Consider now two arbitrary vectors v\ and Vi in V. Use the basis in (a) to prove that 

(ih ■ v 2 ) 2 > v 2 vj , 

where the equality only holds if and only if the vectors v\ and v 2 are parallel. This 
result gives some additional perspective on our proof of (6.34). 

Problem 6.4 Stretched string and a nonrelativistic limit. 

Examine the action (6.88) for a relativistic string with endpoints attached at (0, 0) and 
(a, 0), as in Section 6.7. Consider the nonrelativistic approximation where | uj_ | <<£ c and 
the oscillations are small (see (4.3)). You may denote by y the collection of transverse 
coordinates X 2 , . . ., X d and write y(t, x), where x is the coordinate corresponding to X 1 . 
Explain why the following relations hold: 

2 2 _* d y 

ds — dx + dy ■ dy , v\——. 

dt 

Show that the action reduces, up to an additive constant, to the action for a nonrelativistic 
string performing small transverse oscillations. What are the tension and the linear mass 
density of the resulting string? What is the additive constant? 
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Problem 6.5 Alternative derivation of the nonrelativistic limit. 

Consider the same setup and nonrelativistic approximation discussed in Problem 6.4. This 
time, however, start your analysis with the Nambu-Goto action (6.39) and work in the static 
gauge. Moreover, parameterize the strings using X 1 = x = aa /o\. This parameterization 
is allowed for small oscillations. In fact, it is allowed for any motion in which X 1 is an 
increasing function along the string. 

Problem 6.6 Planar motion for open string with attached endpoints. 

Consider the motion of a relativistic open string on the ( x , y) plane. The string end- 
points are attached to (x, y) = (0, 0) and (x, y) = (a, 0), where a > 0. We want to study 
motions that can be described using the function y(t, x) which gives the vertical displace- 
ment of the string for x e [0, a]. Show that the Lagrangian (6.89) can be written in the 
form: 

L = -T 0 f 
Jo 

Here y' — dy/dx and y = dy/dt. 

Problem 6.7 Time evolution of a closed circular string. 

At t — 0, a closed string forms a circle of radius R on the (x, y) plane and has zero velocity. 
The time development of this string can be studied using the action (6.88). The string will 
remain circular, but its radius will be a time-dependent function R(t). Give the Lagrangian 
L as a function of R(t) and its time derivative. Calculate the radius and velocity as func- 
tions of time. Sketch the spacetime surface traced by the string in a three-dimensional plot 
with x, y, and ct axes. [Hint: calculate the Hamiltonian associated with L and use energy 
conservation.] 

Problem 6.8 Covariant analysis of open string endpoint motion. 

Use the explicit form of V a to calculate and use the result of this calculation to 

prove that free open string endpoints move with the speed of light. 

Problem 6.9 Hamiltonian density for relativistic strings.^ 

Consider the string Lagrangian density C in the static gauge and written in terms of X 
and d t X. Show that the canonical momentum density V(t, a) is given by 

^ dC Tq u_l ds 

V(t,a) = — = — . — - 

3(9 r X) c f~v l da 

Calculate the Hamiltonian density 7 f, again in terms of u_l and 4^. Write the total Hamil- 
tonian as H — f doTi = f ds (. . .) and show that your answer is consistent with the 
interpretation that the energy of the string arises as the energy of transverse motion of 
a string whose rest mass arises solely from the tension. 
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Problem 6.10 Circular string in de Sitter space. 

The Nambu-Goto action on a curved spacetime with metric g/j, v (x) can be written as in 
(6.39) with the convention that all dot products use the metric g^ v : 



X-X' = 8liV (X) X*X V ', (X) 2 =g llv (X)X' 1 X v , (X') 2 = gflv (X)X ,>l X' v . 



Consider strings in an expanding four-dimensional de Sitter spacetime, for which the 
metric g^ v can be taken to be diagonal, with values 



The Hubble constant H has units of one over time so that lit is dimensionless. 

(a) Assume X° = ct = c r and write X 1 ' = { X {) , X}. Write the Nambu-Goto action in 
terms of t and a derivatives of X. 

(b) Consider now a circular string on the (x 1 , x 2 ) plane, namely 

X l (t, cr) = r(t) cos a , X 2 (t, a) = r(t) sin er , a e [0, 2it] , 

where r(t) is a radius function to be determined. Use this ansatz to simplify the string 
action and perform the integration over a . Write your result as 



and determine L(r(t), r(t): t ), which is explicitly time dependent. Because of the e 2Ht 
factors in the metric, the physically measured radius of the string is actually R(t ) = 
e Ht r(t). Write the Lagrangian in terms of R and R. 

(c) Consider strings with constant R and use the Lagrangian to give the potential V (R) for 
such strings. Plot this potential and verify that it is well-defined only if R < c/H. Find 
a critical point of the potential and the corresponding value of R. Is this static string in 
stable equilibrium? 

(d) Use the Lagrangian for R and R to calculate the corresponding Hamiltonian, expressed 
as a function of R and R. Simplify your answer. Is this Hamiltonian function conserved 
for physical motion? 

Problem 6.1 1 Open strings ending on D-branes of various dimensions. 

Consider a world with d spatial dimensions. A Dp-brane is an extended object with p 
spatial dimensions: a /^-dimensional hyperplane inside the d-dimensional space. We will 
examine properties of strings ending on a Dp-brane, where 0 <P< d. The case p = d, 
where the D-brane is space filling was discussed in Section 6.9. 

For a Dp-brane, let x l , with i = 1 , . . ., p, correspond to directions on the Dp-brane, and 
x a with a = p + 1 ..... d, correspond to directions orthogonal to the Dp-brane. The Dp- 
brane position is specified by x a =0, for a = p + 1, . . ., d. Open string endpoints must 
lie on the Dp-brane. Focusing on the o = 0 endpoint, we thus have 



goo = -1 , gu—g22 = g33 = e‘ 



,2 Ht 




X a (t,a— 0) = 0 , a = p + 1, . . ., d . 
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The motion of the endpoint along the D-brane directions x‘ is free. We work in the static 
gauge. 

(a) State the conditions satisfied by Vq , V® , and V" at the endpoint (no condition is a 
possibility!). 

Prove the following. 

(b) All boundary conditions are automatically satisfied if the string ends on a DO-brane. 

(c) For a string ending on a Dl-brane, the tangent to the string at the endpoint is orthogonal 
to the Dl-brane, and the endpoint velocity is unconstrained. 

(d) For a string ending on a Dp-brane, with p > 2, there are two possibilities: 

(i) the string is orthogonal to the Dp-brane at the endpoint, and the endpoint velocity 
is unconstrained, or, 

(ii) the string is not orthogonal to the Dp-brane at the endpoint, and the endpoint 
moves with the speed of light transversely to the string. 
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motion 



r 



We construct lines of constant a that are perpendicular to the lines of constant 
r and use the energy density carried by the string to fix the a parameterization 
completely. The resulting system of string equations of motion includes wave 
equations and two nonlinear constraints. We find the general solution which 
describes the motion of open strings with free endpoints. We examine the free 
evolution of closed strings and find that cusps that appear and later disappear 
are generic. We discuss cosmic strings and the gravitational lensing they can 
produce. 



7.1 Choosing a a parameterization 



We have already learned a few facts about the motion of relativistic strings. In particular, 
we learned that free open string endpoints move with the speed of light transversally to the 
string. This result was obtained using the static gauge X° = ct = cr, which partially fixed 
the parameterization of the world-sheet. Once we have chosen this gauge, the string motion 
is defined by the functions X(t, a). As we vary t and a, X(f, a) describes the string spatial 
surface - the surface in space consisting of the strings at all times. In this chapter, the string 
spatial surface will simply be called the string surface. We will also use the static gauge 
throughout. The statement that the open string endpoints move transversely to the string 
implies that the vectors tangent to the boundary of the string surface are orthogonal to the 
strings. Our goal is to find a useful a parameterization of the string surface. If we know 
this parameterization, we also know the a parameterization of the world-sheet and, in fact, 
the complete parameterization of the world-sheet. 

We will now show how to use a particular a parameterization of a single string to construct 
a useful a parameterization of all strings, and thus of the entire string surface. Suppose that 
the t = 0 string is given some cr parameterization with cr e [0, crj] (see Figure 7.1). Now, 
consider the string at t — e, with e infinitesimal. On the string surface we can draw short 
segments perpendicular to the t = 0 string. Let these segments intersect the t = e string. 
Consider a point cro on the t — 0 string, and the short perpendicular above it. We declare 
the intersection of this perpendicular with the t = e string to also have cr = cro. We do this 
all over the t — 0 string, obtaining a parameterization of the t = e string. We then repeat 
this procedure, using the t = e string to parameterize the t — 2e string. We continue in this 
way, working in the limit of very small e. The result is a set of lines of constant a that are 
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Using the parameterization of the f = 0 string to construct the parameterization of the 
t = e string. On the string surface, the lines of constant a are chosen to be orthogonal to 
the lines of constant f. 



everywhere orthogonal to the strings (that is, orthogonal to the lines of constant f). This 
construction can be done both for open and for closed strings. For closed strings the cr 
range [0, cr c ] of the t — 0 string becomes the range of all other strings. For open strings the 
a range [0, o\ ] of the t — 0 string also becomes the range of all other strings. This happens 
because the boundaries of the string surface are orthogonal to the strings, and, as a result, 
the boundaries are lines of constant cr. 

In summary, the cr parameterization of a given string can be used to construct lines of 
constant a that are always perpendicular to the lines of constant t. In this parameterization 
of the string surface, the tangent dX/da to the strings and the tangent dX/dt to the lines 
of constant cr are perpendicular to each other at any point: 









dx 


dx 






— =0. 


(7.1) 


da 


dt 



Since the velocity dX/dt is perpendicular to the string, it coincides with v±_ (see (6.82)): 




at all points. 



(7.2) 
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and not only at the endpoints, where it happens independently of parameterization. Recall- 
ing that is the length parameter along the string, equation (7.1) implies that ■ ^ = 0, 
which allows us to simplify our earlier results for V TI ‘ and 'P 0// . We find that (6.93) 
becomes 

I-- - 5 - 4 =^- 03) 

Similarly, (6.91) becomes 



V alM = -T 0 




dXV 

~dT' 



(7.4) 



Equation (7.4) holds at the open string endpoints regardless of a -parameterization (see 
(6.96)); with the present parameterization it holds all over the string. 



7.2 Physical interpretation of the string equation of motion 



Having used the a parameterization to simplify some of our previous expressions, we now 
look into the string equations of motion (6.53). With t — r we have 

dV xtl dV a ^ 

= ■ (7.5) 

dt do 



Let us first consider the /r , = 0 component of this equation. It follows from (7.4) that 
V n(} = 0. Furthermore, equation (7.3) gives 



•pro 




Hack into the equation of motion (7.5), we obtain 



dV r0 

dt 



d 

dt 



r r ds 
da 



= o. 



-?) 



(7.6) 



(7.7) 



To understand this result physically, consider a little piece of string associated with a do 
that is a small fixed number. The motion of this particular piece of string is well defined 
now that we have fixed the lines of constant o. Since ds denotes the length of the do piece 
of string, ds can depend on time. If we multiply equation (7.7) by the constant do, we 
conclude that 



To ds 




(7.8) 
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is constant in time. Expression (7.8) has units of energy, which suggests that it may be the 
relativistic energy associated with this piece of the string. Indeed, this is in agreement with 
our findings in Section 6.7 where we saw that the rest energy of a static stretched string is 
given by its length times the tension Tq. The rest energy in the above expression is Tods, 
and the relativistic factor in the denominator makes (7.8) the total energy. Equation (7.7) 
therefore states that the energy in each piece da of the string is conserved. This is a very 
interesting fact. It implies, for example, that the energy in the range [0, ero] of an open 
string is constant in time for any fixed op- 

The above interpretation is also confirmed by a calculation of the energy £ of a 
relativistic string (Problem 6.9). You found that the Hamiltonian is 



H = 



/ 



Tods 




(7.9) 



which shows that (7.8) is the energy in a piece of string. 



Now we turn to the space components of the string equation of motion. The space 
components V x of V T 11 can be read from (7.3): 

ds 

— Vj_, (7.10) 



and similarly, from (7.4), 

/ 9X 

p (7A1) 

Now we can substitute these expressions back into (7.5) to find that 




9 

da 



[To 



v \ 9Y-| 

c 2 ds J 



_9_r7o 

dt L c 2 




iti dS q “* 

To dv± 



dt 



1 - 
1 0 



(7.12) 



where the final step used equation (7.7). It is possible to interpret this equation loosely 
in terms of an “effective” nonrelativistic string. Recall that the equations of motion for a 
classical nonrelativistic string are 



3 2 y _ 3 2 y _ 3 3 y- 

dt 2 - T ° dx 2 - dxl T ° dx\’ 



(7.13) 



where x is a length parameter along the direction defined by the static stretched string, and 
y is the transverse displacement. How do we recast (7.12) to resemble (7.13)? We use the 



134 



String parameterization and classical motion 



ds /da factor on the right-hand side to transform the cr -derivative on the left-hand side into 
an s -derivative: 



For small oscillations the length parameter s along the vibrating string is roughly equal to 
the parameter x along the direction of the static string. We can then compare this equa- 
tion with (7.13) and conclude that the relativistic string has a velocity-dependent effective 
tension 7^ and a velocity-dependent effective mass density /x e ff given by 



Since free open string endpoints move with i>j_ = c, the effective tension of the string goes 
to zero at the endpoints. One could say that the endpoints have to move at the speed of light 
in order to make the tension vanish at the endpoints. This is the only way that the relativistic 
string can have a tension and still make sense with free open ends. The effective mass 
density diverges at the endpoints. This is not a problem; the same divergence is present for 
the energy density which appears as the integrand in (7.9). Despite the singular behavior at 
the endpoints, the integral turns out to be finite, as required by consistency since, after all, 
we are describing strings with finite energy. 



Equation (7.14) is still fairly complicated. It may seem that, having fixed the lines of con- 
stant cr, we have run out of reparameterizations that can help us simplify the equations of 
motion. This is not the case, however. We showed how to construct the lines of constant 
a if we have one parameterized string. We must now try to parameterize this first string in 
the best possible way! 

Here is the physical way to do so: we will parameterize the string so that each string seg- 
ment of equal parameter length a carries the same amount of energy. We will parameterize 
the string using the energy! This parameterization will yield simple equations of motion. To 
see this, we first rewrite (7.14) suggestively by changing j-derivatives into a -derivatives: 





(7.15) 



7.3 Wave equation and constraints 




(7.16) 
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Let A(a) denote the ratio that appears prominently in the above equation: 



A(<r) = 



ds 

do 



1 



(7.17) 



We already showed in (7.7) that A(er) is independent of time. We will now choose a in 
such a way that A = 1. If we do so, the equation of motion (7.16) becomes the familiar 
wave equation: 



1 d 2 X _ d 2 X 
c 2 dt 2 da 2 



(7.18) 



This is a great simplification. To find a a that leads to A — 1 we assign a — 0 to one 
endpoint of the open string and work our way along the string assigning to each piece ds 
of string the interval da given by 



da 





(7.19) 



The first equality implies A = 1 , and the second equality follows from the identification of 
(7.8) with the energy dE carried by the little piece of string. In this parameterization the 
energy density d E Ida is a constant equal to the tension. Equation (7.19) can be integrated 
from the a = 0 endpoint up to a point Q, giving 

E(Q ) 

o(Q)= (7.20) 

The coordinate a{Q) assigned to Q equals the energy E(Q) carried by the portion of the 
string stretching from the selected endpoint up to Q, divided by the tension. It also follows 
from the above equation that 



E 




a e [0, <71 ], cr 1 = — , 

f 0 


(7.21) 



where E is the total energy of the string. 

Our choice of a, done for all strings, is consistent with the orthogonality condition (7.1). 
In fact, the lines of constant a have not been changed, only the values of a assigned to them 
have. Our constant a lines still ensure that the energy in the [0, a] portion of the strings is 
constant and such lines are orthogonal to the strings. 

The parameterization condition (7.19) is actually equivalent to a differential constraint 
on the coordinates X. We first rewrite the first equality in (7.19) as 

/ ds \ 2 1 9 

= (7 ' 22) 
Recalling that dX/ds is a unit vector, and making use of (7.2), we find 



/9X\2 1 /dX\2 

' da ) c 2 ^ dt ) 



= 1 . 



(7.23) 
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Finally, let us examine the boundary conditions. From (7.1 1) we have 



/ v 2 , da dX dX 



(7.24) 



Therefore, the free endpoint boundary condition is very simple: 



— =0 at the endpoints. (7.25) 

da 

With the chosen parameterization, the free endpoint boundary condition, first introduced 
in (6.56), is simply a Neumann boundary condition. 



All in all we have four equations to solve in order to find the motion of a relativistic string: 
(7.18), (7.1), (7.23), and (7.25). We collect them here: 



wave equation : 


d 2 X 1 d 2 X 


= 0, 


(7.26) 


da 2 


c 2 dt 2 


parameterization condition : 


dX 

~di 






(7.27) 


parameterization condition : 


/dX\2 1 /3Z\2 

w) + A-w) =1 ’ 


(7.28) 


boundary condition : 


dX 

da 


dX 

a = 0 3 a 


= 0. 

0=CT\ 


(7.29) 



For a string with energy E, the above equations require a\ = E/Tq. For the record, we 
also include (7.19), written as 



1 dE 
To da 




Finally, from equations (7.3) and (7.4) we get 



•pt/i _ To 

c 2 

V a/l = -T 0 



3 X' 1 
~dt ~ ’ 
dX» 
da 



(7.30) 



(7.31) 

(7.32) 



7.4 General motion of an open string 



In this section our goal is to describe the general motion of open strings with free boundary 
conditions. We will therefore examine in detail how to solve equations (7.26)-(7.29). 
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Let us first consider the wave equation for X. This equation is readily solved in terms of 
arbitrary vector functions of (ct ± a). We thus write 



X(t, a) = -(F(ct + a) + G(ct — er)). 



The boundary condition at the a = 0 endpoint demands that 

= 0 — > F'(ct) - G'(cf) = 0, 



dX 

da 



< 7=0 



(7.33) 



(7.34) 



where prime denotes derivative with respect to the argument. Since ct takes all possible 
values, the above equation holds for all values of the argument. Calling u the argument, we 
write 

dF(u) dG(u) 



G(u) = F(u) + ao, 



du du 

where a o is a constant vector. Back in (7.33) we now have 

1 



X(t, a) = F(ct + a) + F(ct - a) + SqJ- 



(7.35) 



(7.36) 



We can absorb the constant vector So into the definition of F (call F(u) + a q/ 2 the new 
F), and therefore our solution so far takes the form 



X(t, a) = ~^F(ct + a) + F(ct — er)^. 
Consider now the boundary condition at a = <7| : 



dX 

da 



= 0 



F'(ct + a\) — F'{ct — <7i) = 0. 



Letting u — ct — a\, the above condition becomes 

clF dF 

— (u + 2a i) = — (w). 
du du 



(7.37) 



(7.38) 



(7.39) 



This equation tells us that the derivative of F is periodic with period 2o\ . Integrating, we 
find that the function F is quasi-periodic: after a period 2a\ it changes by a fixed constant. 
We write this as 



F(u + 2a\) = F(u) + 2a\ — , 

c 



(7.40) 



where no is a vector constant of integration with units of velocity, and the constants have 
been added for convenience. This concludes our analysis of the boundary conditions. 



We now examine what restrictions the parameterization conditions (7.27) and (7.28) 
impose on the function F. A standard trick is to add and subtract the first equation to 
the second, as in 



/d)t\2 ax l dx l /dX\2 

V 3 a) da c dt + A dt ) 



(7.41) 



which can then be written more briefly as 
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/ dX IdX 
y da c dt 



= 1 . 



(7.42) 



Note that this is equivalent to the two constraints (7.27) and (7.28). Using (7.37) we can 
evaluate the derivatives that enter into the constraints: 

^ F'(ct+a ) - F\ct - cr)), 

i B x i . . ( 7 - 43 ) 

= ~(F'(ct+a) + F' (ct - a )). 
c dt 2 

As a result, 

3X 1 dX 

— ±-— = ±F\ct±a). (7.44) 

da c dt 

It follows that the constraints (7.42) require F' ■ F' = 1 for all arguments of F . In other 
words, the vector F'(u) is a unit vector: 



dF(u) 2 
du 



(7.45) 



This is good progress: the constraints give a simple condition for F(u). Since F is a vector 
function of the parameter u, we can visualize F(u) as a parameterized curve in space. 
Equation (7.45) has a simple interpretation: 



u is a length parameter along the curve F (u). (7.46) 



This is explained as follows. Consider two nearby points F(u + du) and F(u) on the curve. 
Their vector separation dF — F(u + du) — F(u) has length d F \ . Equation (7.45) implies 
that \dF\ = \du\, showing that the parameter change \du\ is the distance between the two 
nearby points. 

We can now summarize our analysis of the equations of motion. The general solution which 
describes the motion of an open string with free endpoints is given by 



1 /- 

X(t, a) = — yE(cf + a) + F(ct 



4 



a e [0, cr i ] , 



(7.47) 



where eri — E/Tq, E is the energy of the string, and F satisfies the conditions 
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The vector function F(u) changes by a constant vector (2 cr^v 0 /c) when u u+ 2<r-|. The 
parameterized curve F(u) encodes the full motion of an open string with free endpoints. 



dF(u) 2 
du 



- - Vn 

and F(u + 2o\) = F(u) + 2cri - . 

c 



(7.48) 



The problem has been reduced to that of finding a vector function F which satisfies 
equations (7.48). The second of these equations tells us that it suffices to find F{u) for 
u e [0, 2cri]. This determines F(u) for all u, and thus it determines X(t,a) completely. 
The interpretation of no will be given below. An illustration of F is shown in Figure 7.2. 



We can give a physical interpretation to F(u). It follows from (7.47) that the motion of the 
a — 0 endpoint of the open string is described by 

X(t, 0) = F(ct). (7.49) 

Therefore, we see that: 



F(u) is the position of the a = 0 endpoint at time u/c. (7.50) 

Additionally, we can give a physical interpretation to the constant velocity vq. From the 
second equation in (7.48) we have 

F(2a0 = F(0) + 2a 1 ^°, (7.51) 

c 

and expressing the F in terms of the position of the a = 0 endpoint, we find 

= o) = Z(r = 0,0) + (^)uo. 



(7.52) 
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This shows that vq is the average velocity of the a — 0 endpoint during the time interval 

[0,2 m/c]. 

Quick calculation 7.1 Show that vq is, in fact, the average velocity of any point a on the 
string calculated over any time interval of duration 2eri /c. 

Quick calculation 7.2 Show that the velocity of any point on the string is a periodic 
function of time with period 2 ctj/c, namely, X(t , o) = X{t + , a). 

Since F can be reconstructed by looking only at the a = 0 endpoint, we may ask: how long 
do we need to observe this endpoint in order to determine the full motion, past and future, 
of an open string with energy El Since the motion is determined if we know E(u) from 
u — 0 to u = 2cti, we must observe X(t, 0) from t = 0 to t = 2o\/c. Since esq = E/Tq, 
we need to observe the endpoint for a time interval At = 2 E /cTq. This is twice the time 
that light takes to travel a length E / 7o, the length of a static string of energy E. 



We now use the above construction to describe the motion of a straight open string with 
energy E which rotates rigidly about its fixed midpoint in the (x, y) plane (Figure 7.3). 
Our first goal is to produce the function F{u). This function is easily constructed because 
we know all about the motion of the endpoints. Assuming that the string is of length l and 
rotates with angular frequency a>, we describe the motion of the a — 0 endpoint by 



X(t, 0) 



1 

2 



(cos cot, sinwr), 



(7.53) 



where we use vector notation with two components, since the motion is restricted to the 
(x, y) plane. Given that F(ct) = X(t, 0) we have 
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(7.54) 



The function F is periodic, suggesting that the vector vq in (7.48) must vanish. More 
precisely, vq vanishes because it is the average velocity during the period in which the 
velocity repeats, the velocity repeats after precisely one full turn, and the average velocity 
of any point during one full turn is zero. With = 0 equation (7.48) imposes the condition 
F(u + 2(Ji ) = F(u). Using (7.54), this gives 

co co n 

— (2o\) = 2 jt m — > — = — m, (7.55) 

C C CJ[ 



where m is an integer. It is simple to see that we must choose m = 1. For this we just 
calculate X (0, a), which gives the string at time equal to zero: 



i(0, a) 




F(—a )) = 




Tcma 







(7.56) 



For m = 1, the string is recovered when a e [0, crj]. For arbitrary m, the function X((). a) 
traces out the string m times when a e [0, cri ] . Choosing m — I . we find 



co it tcTq 
c a i E 



(7.57) 



This gives the angular frequency of the motion in terms of the energy. The first condition 
in (7.48) determines the length t. Indeed, 



and, as a result, 

dF 2 
du 

This length is smaller, by 
This is sensible since this 

E=\ T{)1, (7.60) 

which states that the energy of a rotating string is a factor of 7r/2 bigger than the energy of 
a static string with the same length. Note also that co{l/2) = c, telling us that the endpoints 
move with the speed of light. In this solution the energy is proportional to the length of the 
string. Perhaps surprisingly, as the energy of the string increases, the angular velocity co 
decreases. This happens because the string endpoints have to move at the speed of light, so 
as the length of the string grows the angular velocity has to decrease. 

Having determined co and l, we really know the motion of the string. It is of inter- 
est, however, to write the complete expression for the motion of the parameterized string 
X(t, a). In terms of oq, the vector F in (7.54) is now given by 

F{u) — — ( cos — , sin — ) . 

7T V (7 1 (7i / 
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7T To' 



(7.58) 



(7.59) 



a factor of 2/n, than the length of a static string with energy E. 
string has kinetic energy. Alternatively, 



(7.61) 
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Finally, using (7.47) we obtain 



- (71 

X(t, ct) = — ( cos 
2 jt 



n (ct + ct) 
CTl 



n(ct — ct) n(ct + o) 
cos , sin 



sin - 



n(ct — er) 



o-i 



o' l 



o-i 



( 7 . 62 ) 



and after simplification. 



- cri jtct ( net nct\ 

X(t, ct) = — cos ( cos , sin 1 . 

n CT] \ ( 7 ) (71 / 



( 7 . 63 ) 



The parameterized string is of interest because the energy density is a constant function of 
the parameter ct. In Problem 7.2 you will calculate the energy £ (s) per unit length on the 
string as a function of the distance s' to the center: 



£{s) = 



To 






( 7 . 64 ) 



At the center of the string the energy density £(()) coincides with Tq. This had to be so, 
because the string does not move at the center. The energy density diverges at the string 
endpoints s = ± lj2 . The total energy, however, is finite. 



7.5 Motion of closed strings and cusps 



Let us now consider the general motion of a free closed string. Just like for open strings, 
both the wave equation (7.26) and the parameterization constraints (7.27) and (7.28) apply. 
The wave equation is solved by the expression 

X(t,o) = ^(F(u) + G(v)], ( 7 . 65 ) 

where we have introduced variables u and v defined by 

u = ct T < 7 , 

( 7 . 66 ) 

V = Ct — ( 7 . 



Taking derivatives of X we immediately find 



1 dX 1,-, . 

c dt 2 y ’ 

dX 1,-., , 

- = -(F' ( u)-G'Cv)), 



( 7 . 67 ) 



( 7 . 68 ) 



where primes denote derivatives with respect to argument. We can now form the linear 
combinations 



dX 1 dX 
do ^ c dt 



F'(u) and 



dx l dx 

do c dt 



-G\v). 



( 7 . 69 ) 
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The tips of the unit vectors F'(u) and G'(v) define two closed parameterized curves that, 
in this case, intersect for u =u 0 and v =v 0 . 



The parameterization constraints, summarized in (7.42), then give 

|f'(m)| 2 = |G'(u)| 2 = 1, for all u, v. (7.70) 

For closed strings we do not have boundary conditions but rather a periodicity condi- 
tion. Since da — dE/To on account of (7.30), the closed string parameter a has the 
identification 

ct~ct + cti, where ai — E/To , (7.71) 

and E is the energy of the string. As a increases by eri we are back to the same point on 
the closed string and therefore 





X(t, a + eri) = X(t, a) 


(7.72) 


is the periodicity condition. Making use of (7.65), this condition gives 






F(u + a i) + G(v - cr i ) = F(u) + G(v ), 


(7.73) 


or, equivalently, 


F(u + <7i) — F(u) — G(v ) — G(v — eri). 


(7.74) 



The functions F(u) and G(v) need not be periodic with period a \ , but they must change by 
the same vector when their arguments are increased by o\ . Since u and v are independent 
variables, partial derivatives of (7.74) with respect to u and v give 

F’(u + eri ) = F\u) and G\v + a x ) = G'(v). (7.75) 

Equations (7.70) and (7.75) imply that F\u) and C (v) are periodic unit vectors; they can 
be described as two independent parameterized closed curves on the surface of a unit two- 
sphere (Figure 7.4). A closed string motion is fully specified, up to a constant translation, 
by these two parameterized curves. Indeed, the curves fix F'(u) and G' (v) and, by inte- 
gration, F(u ) and G(v) up to integration constants, resulting in X(t, a) fixed up to the 
addition of a constant vector. 
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An interesting situation arises quite generically. As shown in Figure 7.4, the two param- 
eterized curves F' (n) and G'(v) may intersect for some values no and vq of the parameters 
u and v: 

F'(u 0 ) = G'(v 0 ). (7.76) 



Let to and ctq be the values of t and a defined by no and i>o through (7.66). It now follows 
from (7.67) that 

1 dX 1 

t (to, cr 0 ) = -(F'(no) + G\v o)) = F\uo). (7.77) 

c at 2 

Since F’ is a unit vector we learn that at t = to the point o = go on the string reaches the 
speed of light! Moreover, the motion of the point is in the direction of F'(uq). Additional 
information follows from (7.68): 



9 X 1 

— {to, do) = ~{F'(uo) - G'(i>o)) = 0. 



(7.78) 



This means that the parameterization of the t — to string becomes singular at a — op. To 
examine the shape of the string near a = cro we fix t = to and use a Taylor expansion 
around a = op: 



3X 



1 



,d z X 



X(t 0 , a) = X(t 0 , cro) + (cr - cr 0 ) — (t 0 , cr 0 ) + -(a - cr 0 ) — -y(fo, cr 0 ) 



1 

+ rrM - cr 0 y 



,d 3 X 

3U~ 3^3 

Using (7.78) and the definitions 



3 a 



(to, cr 0 ) 



da 2 



- d 2 X - d 3 X 

T = -7^2 (tO, cro), R = (to, a o), 



Xo = X(to, (to), 

we find that the expansion (7.79) loses the term linear in a — op and becomes 



X{to, cr) = Z 0 + ^(ct - cr 0 ) 2 f + ~ (cr - ctq) 3 R H • 



(7.79) 



(7.80) 



(7.81) 



For generic situations T and R are nonvanishing and nonparallel. It then follows that for 
a = ao we have a cusp. A cusp on a string is a point where the two outgoing string direc- 
tions form zero angle. Equivalently, at a cusp the oriented tangent to the string reverses 
direction. Equation (7.81) describes a cusp at Xo'. as a grows from values right below cro 
to values right above cro the string approaches Xo along T and recedes from Xo along T . 
Indeed, near Xo, a — ao is very small and the term that contains T dominates the expan- 
sion (7.81). As we move away from Xo the cusp opens up due to the term that contains 
R . If we choose a coordinate system in which the tip of the cusp is at the origin, T points 
along the positive y axis, and R lies on the (x, y) plane, the cusp traces the curve y ~ x 2,/1> , 
as you will show in Problem 7.7. 

If a cusp occurs for (no, vq) it will also occur for (no + ma\, vq + no\), with m and n 
arbitrary integers. As a result, as time goes by, cusps will appear and disappear periodically 
at various points on the string. Given any two parameterized paths F'{u) and G' (v) on the 
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two-sphere, there may be several different intersection points. Each will give rise to a set 
of cusps. 



Our analysis of the classical motion of relativistic strings may be applied to the study of 
cosmic strings. It is conceivable that physics in the early universe results in strings that 
are not microscopic but rather expand with the universe to become very large. The motion 
of cosmic size strings can be study in the classical approximation. As of 2007 no cosmic 
string has been detected. The discovery of a cosmic string would be a remarkable event. As 
it turns out, cosmic strings may arise from phenomena unrelated to string theory so, even 
if discovered, much work will be needed to tell if a cosmic string is a string theory string. 

The most direct way to detect a cosmic string is through gravitational lensing. To under- 
stand this we begin by discussing the gravitational effects of a straight, infinitely long 
relativistic string. Suppose you place a massive particle some distance away from such a 
string. Since the string has rest energy one may imagine that the particle would experi- 
ence gravitational attraction. This is not the case; the particle would experience exactly 
zero force. This is a result in general relativity and would not hold in Newtonian gravita- 
tion where only the effective mass density jio of the string contributes to the gravitational 
attraction. In general relativity the tension of the string gives an additional contribution, 
in fact, a gravitational repulsion. The total attractive force is proportional to (/xo — 5r), a 
combination that precisely vanishes for relativistic strings. 

Although a string does not exert gravitational attraction it affects the geometry of the 
planes orthogonal to the string. Suppose you go around the string keeping your distance 
r to the string constant. The circumference C of the traced circle would be less than the 
expected value of 27rr. More precisely, for any radial distance r, 



where the constant A is called the deficit angle. The two-dimensional spaces orthogonal 
to the string are in fact cones with deficit angle A. The string runs along the apexes of the 
cones. Indeed cones are spaces in which circular loops at constant distance from the apex 
satisfy (7.82). This can be made manifest in the construction of a cone starting from the 
plane. If we represent the plane using the complex variable z = x + iy (as we did when 
studying orbifolds in Section 2.8) the cone arises by cutting out the region 0 < arg(z) < A 
and identifying the resulting boundaries via z ~ e‘ A z. This is shown in Figure 7.5, where 
we also show a circle with constant distance to the apex z = 0. In our discussion of lensing 
we will assume, for simplicity, that the observer O and the light source S lie on the same 
cone, namely, they share the same value of the coordinate along the string. 

The deficit angle A produced by the relativistic string depends on the tension of the 
string and the value of Newton’s constant G. A calculation in general relativity shows that 
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(7.82) 



r 



SjvGTq _ SttG/xq 
c 4 c 2 



(7.83) 
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A 





Left: the complex z plane with the set of points whose argument is greater than zero and 
less than A removed. Right: the familiar picture of a cone obtained by identifying the lines 
AM and AM' on the left. 



The angle A, as we will see soon, is closely related to lensing angles, whose typical values 
are measured in arc seconds. Noting that 1" = 4.85 x I 0~ 6 rad, we can write 

a-j.»-x(^£). <784) 

Quick calculation 7.3 Use the values of G and c to show that 

A = 3.85" x ( — V (7.85) 

V 10 21 kg/m/ 

Six kilometers of a string with A = 3.85" would pack a mass equal to that of Earth. 



Further insight into the value of A can be obtained by writing (7.83) as a manifestly 
unit-less ratio of the Planck mass m p and a string mass m s , defined as the unique mass that 
can be written using only powers of fz q, c, and h. 



Quick calculation 7.4 Show that 




Recalling that mp = *Jhc / G we find 

m s _ jGfi o 

mp V c 2 

and, as a result. 




(7.86) 



(7.87) 



(7.88) 



A small A arises for m s small compared to mp. 

To discuss lensing we first review some properties of geodesics. A curve that joins two 
fixed points is a geodesic if its length is stationary under arbitrary infinitesimal defor- 
mations that vanish at the fixed points; the length does not change to first order in such 
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deformations of the curve. In the plane there is just one geodesic between any two points: 
the straight line that joins them. This geodesic is also the shortest path between the two 
points. On more complicated spaces geodesics need not be unique nor have minimum 
length. There are infinitely many geodesics joining the north pole and the south pole of a 
two-sphere, each one a half-circle of constant longitude. For two generic points on the two- 
sphere you get two geodesics, a short one and a long one. Indeed, the two points determine 
a great circle, and the geodesics are the two complementary arcs bounded by the points. 
The short geodesic is the shortest path between the two points. The long geodesic is only a 
path of stationary length, a saddle point for the length function: some deformations make it 
longer and others make it shorter. In fact, the long geodesic can be continuously deformed 
into the short geodesic. On a cylinder any two points can be joined by an infinite number 
of geodesics. The geodesics are characterized by the number of times they wrap around 
the cylinder while getting from one point to the other. 

Quick calculation 7.5 Sketch and find the length of the five shortest geodesics that join 
the points (0, 0) and (1,0) on the cylinder (x, y) ~ (x, y + 1). 

On a cone there is an intricate pattern of geodesics. Consider two points P and Q and the 
geodesics from P to Q. The number of geodesics depends on the deficit angle A of the 
cone at the apex A and the angle 0 between P and Q, defined as the positive angle that by 
clockwise rotation takes AP to AQ. The angle 0 must necessarily be smaller than the total 
angle a at the apex: 

0 < a = 2 jt — A. (7.89) 

For a complete calculation of the geodesics see Problem 7.8. Here we restrict ourselves 
to the case A <5jC 1, relevant to possible astrophysical situations. Assume that the observer 
O is a distance do from the string and that the light source S is a distance d$ from the 
string. Lensing occurs if the source S is roughly opposite to O across the string. In this 
case there are two geodesics that join S and O , one to each side of the string. Light uses 
both geodesics to reach O , who detects two identical images of the source S. Two identical 
images is a signature of lensing by strings and occurs because cones have no curvature 
away from the apex. Lensing by compact objects can produce more than two images and, 
since the relevant geometry is curved, the images are distorted and can be quite different. 

To visualize the geodesics we represent the cone by placing the source on the two radial 
lines that are to be identified across an angle A (Figure 7.6). The source appears as the 
points S and S', and the two geodesics that join the source and the observer are the straight 
lines S O and S' O . The angle 50 between S O and S' O at O is the lensing angle - the angle 
between the images seen by the observer. From the figure we see that 

50 = a + P- (7.90) 

Moreover, since A is the sum of exterior angles for the triangles SAO and S' AO, it equals 



(7.91) 
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A source S and an obserber 0 at distances d s and d 0 , respectively, from the apex A of a cone 
with deficit angle A. For simplicity, the cut is along the source, which appears as points S 
and S' to be identified. Rays SO and S'O from the source reach the observer in directions 
separated by the lensing angle &<j> = a + /?. 



Since each angle that appears on the above right-hand side is positive and A <$C 1, each one 
of them is small: a, a' , /J, /T 1. The law of sines then gives 

sin a sin a' sin B sin B' a a' B B' 

= , — - = — — — = — , — = — , ( 7 . 92 ) 

ds do ds do ds do ds do 

using the small angle approximation. Solving for a' and />' and substituting back in (7.91), 
we have 

A= a + p+ d ^(a + P) = S<p(l+ d ^), ( 7 . 93 ) 

ds \ d S ' 

where we also used (7.90). We finally get 



S(p = 



0 + £)' 



( 7 . 94 ) 



We see that the lensing angle S(f> is bounded above by A. It approaches this upper bound as 
ds oo. Figure 7.6 also makes clear that lensing occurs only if O lies within the sector 
MAM' . If O lies outside this sector, there is just one geodesic and O sees a single image. 



In the past few years images of galaxies that were candidates for string lensing were later 
shown, in higher resolution, to be the images of two similar but different galaxies. Since 
cosmic strings possibly move with relativistic speeds, it may be that lenses exist only for 
limited periods of time, making their detection challenging. 



It may be possible to infer the existence of cosmic strings indirectly. Cosmic strings can 
give contributions to temperature anisotropies in the cosmic microwave background. The 
lack of observed evidence to this effect suggests the bound 
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Equation (7.84) shows that for strings obeying this bound, tensing angles will be smaller 
than a couple of arc-seconds. Additionally, the motion of cosmic strings produces gravita- 
tional waves. In fact, the cusps we studied in the previous section are efficient generators of 
gravity waves. It seems possible that such waves could be observed by gravitational wave 
detectors even for strings of G/r-o/c 2 ~ 10 -13 . While it remains a long shot, cosmic strings 
may furnish the first experimental evidence for string theory. 



Problem 7.1 Short proofs concerning open strings with completely free endpoints. 

Prove the following four statements that refer to open strings with free endpoints. 

(a) If one endpoint of an open string happens to lie at all times on a hyperplane, the full 
open string lies at all times on the same hyperplane. 

(b) If one endpoint of an open string happens to lie at all times within a distance R from a 
point P 0 , the full open string lies at all times within a distance R from Pq. 

(c) If one endpoint of an open string happens to lie at all times within a convex subspace, 
the full open string lies at all times within that convex subspace. [This is the general 
version of the results proven in (a) and (b).] 

(d) Show that the length i of an open string parameterized with energy (Section 7.3) is 
given by 



Problem 7.2 Exploring further the rigidly rotating string. 

Let s e {—1/2, 1/2) be a length parameter on the rigidly rotating string studied in Sec- 
tion 7.4, with .v = 0 chosen to be the fixed center of the string. Let £{s) denote the energy 

per unit length as a function of 5. 

(a) Show that £{s) = Tq/^ 1 — (As 2 /l 2 ). Plot £(s) as a function of s. Note that £(s) has 
integrable singularities at the string endpoints, and confirm that the total energy is 
%ITq. 

(b) For what points on the string is the local energy density equal to the average energy 
density? 

(c) Calculate the energy E(s) carried by the string on the interval [—5, ,v]. What is the 
value of s/{t/ 2) for this energy to be half of the total energy of the string? 90% of the 
energy? 

Problem 7.3 Time evolution of an initially static closed relativistic string. 



The time development of closed strings in the static gauge is governed by equations (7.26), 
(7.27), and (7.28). 



Problems 
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Problem 7.4: Kasey's relativistic jumping rope. 



(a) Assume that (0, cr) — 0, and write the general solution for X (t, a) in terms of a 
vector function F(u) of a single variable. What do the parameterization conditions 
require onf? 

(b) On a closed string the parameter a lives on the circle a ~ a + cr\. What condition 
would you impose on X(t, cr) to implement this feature? What are the implications 
for FI 

(c) Consider a closed string which at t — 0 is static and traces a closed curve y of length l. 
Calculate the smallest time fp > 0 that must elapse for the closed string to trace the 
curve y again. Is X(/p, a ) equal to X(0, cr)? 

(d) List the steps you would take with a computer (that can do integrals and invert func- 
tions) to produce the time evolution of an initially static closed string of arbitrary 
shape lying on the ( x , y) plane. Assume that the initial string is given to you as the 
parameterized closed curve (x(X), y(X)), with some parameter X e [0, /.()]. 

Problem 7.4 Kasey’s relativistic jumping rope. 

Consider a relativistic open string with fixed endpoints: 

Z(t,0)=xi, X(t, cr i ) = X 2 - (1) 

The boundary condition at o = 0 is satisfied by the following solution to the wave 

equation: 

X(t, cr) = x\ + + cr) - F(ct - cr)^. (2) 

Here F is a vector function of a single variable. 

(a) Use (2) and the boundary condition at a — o\ to find a condition on /* (« ). 

(b) Write down the constraint on F(u) that arises from the parameterization conditions 
(7.42). 
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As an application, consider Kasey’s attempts to use a relativistic open string as a jumping 
rope. For this purpose, she holds the open string (in three spatial dimensions) with her right 
hand at the origin x\ = (0, 0, 0) and with her left hand at the point z = Lo on the z axis, or 
X 2 = (0, 0, Lo) (Figure 7.7). As she starts jumping we observe that the tangent vector X' 
to the string at the origin rotates with constant angular velocity around the z axis forming 
an angle y with it. 

(c) Use the above information to write an expression for F'(u). 

(d) Find a\ in terms of the length Lo and the angle y . 

(e) Calculate X(t, a ) for the motion of Kasey’s relativistic jumping rope. 

(f) How is the energy distributed in the string as a function of z? 



Problem 7.5 Planar motion for an open string with fixed endpoints. 



Consider the motion of a relativistic open string on the ( x , y) plane. The string end- 
points are attached to (x,y) = (0, 0) and (x, y) = (a, 0), where a > 0. As opposed to 
the relativistic jumping rope, the string now remains in the (x, y) plane. The motion is 
described by 

X(t,a) =]^{P(ct +a) - F(ct - a)^, (1) 

where F(u) is a vector function of a single variable which satisfies 



dF 2 
du 



= 1 



and F(u + 2o\) 



Consider an ansatz of the form 



F(u ) + (2a , 0). 



( 2 ) 



(a) 

(b) 

(c) 

(d) 

(e) 



dF / r 

F (u) = = ( cos 

du \ L 



y cos 



TCU 



O'! 



r 7TU1 \ 

sm^ycos — y 



(3) 



Is this ansatz consistent with the conditions in (2)? 

Calculate X\0 , er). Letting X(0, a) = (x(er), y(cr)), give dy /da and plot it as a func- 
tion of a e [0, cti], assuming, for convenience, that 0 < y < tz/2. Use this to make a 
rough sketch of the string position y(a) as a function of a at t = 0. 

Calculate X'(t, 0) and use it to describe the motion of the string near the origin. What 
is the interpretation of y ? 

Use the second condition in (2) to find an integral relation between a, a \ and y. 
Assume that y is small, and find an approximate explicit relation between these three 
variables keeping terms of order y 2 . 

Show that a/o\ = 7o(y ), where Jq is the Bessel function of order zero. [Hint: look up 
integral representations of Bessel functions. ] 



Problem 7.6 Planar motion ( continued ) and the formation of cusps. 

We investigate further the solution obtained in Problem 7.5, where it was shown that 
a/o\ — / 0 (y). Here a is the distance between the fixed endpoints of the open string, and 
ori is the length parameter of the string: it equals the length of the string at any time when 
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the string has zero velocity (why?). The relation between a and the angular variable y 
is unusual since Jq is not a periodic function. This means that the open string motions 
corresponding to y and 2 tt + y are not the same. 



(a) Show that the instantaneous slope of the string is described by 

/ 7 ret no\ 

X (t, a) = cosl y sin sin ) (cos /6, sin ft ) , 

V a i o\ / 

where 

net no 

P = y cos cos . 

<71 <71 



Show that at ct — o\ /2 the string is horizontal. 

(b) Prove that the instantaneous (transverse) velocity of the string satisfies 



1 dX 




/ net no\ 


c dt 


— 


sin( y sin sin — ) 

V o\ o\ / 



Note that at t = 0 the string has zero velocity. Conclude that whenever y < n/2 no 
point on the string ever reaches the speed of light. Moreover, show that for y = n/2 
the string midpoint <7 = o\/2 acquires the speed of light when the string is horizontal. 

(c) A tractable case is obtained for y = ~/2(n /2) (~ 127.3°). Show that at ct = o\/A one 
point on the string reaches the speed of light. Examine the string at the slightly later 
time ct — <7i /3, show that there are two points that have the speed of light, and find 
the corresponding values of a . Analyze X' as a function of o to show that the string 
has a cusp at each of these points. 

(d) Use your favorite mathematical software package to generate the picture of the string 
considered in (c) at various times (you will use numerical integration). Assume that 
a — 1 and verify that <7i ~ 10.155. Show the string for ct — 0, <7i/4, and <ti/3. 



Problem 7.7 Cusps in the evolution of closed strings. 

In this problem we derive a few properties of the cusps that appear generically in the 
evolution of free closed strings. For this we examine in more detail equation (7.81). 

(a) Use the Taylor expansions of F{u) and G(v) around wo and r>o to prove that 

T = { -{F'\u 0 ) + G"(v o)), R = ^(F w (h 0 ) - G"> 0 )), (D 

where primes denote derivatives with respect to the argument. Assume that the inter- 
section of the paths on the two-sphere indicated in equation (7.76) is regular: the paths 
are not parallel at the intersection and neither F"{uq) nor G”( vq) vanishes. Explain 
why T is non-zero and orthogonal to F'(uq). In general R does not vanish, but it may 
under special conditions. 

(b) Equation (7.81) shows that the cusp opens up along the direction of the vector T and, 
locally, is contained in the plane spanned by T and R. Fix the origin of the coordinate 
system at Xq, align the positive y axis along T, and position the x axis so that R lies 
on the ( x , y) plane. Demonstrate that near the cusp y ~ x 2 ^. In what plane does the 
velocity of the cusp lie? 



7.6 Cosmic strings 




Problem 7.8: a cone, shown shaded, with total internal angle a at the apex and two points 
P and Q separated by an angle 0. 

(c) Consider the functions F{u) and G(v) given by 



Verify that conditions (7.70) and (7.74) are satisfied. For the cusp at t — a = 0 give its 
direction, the plane it lies on, and its velocity. Draw a sketch. 

(d) Show that the motion of the closed string has period eri/(4c). (Remember, as you 
found in Problem 7.3, that the string returns to its original position in less time than 
the function F(ct + cr) takes to repeat itself.) How many cusps are formed during a 
period? 

Problem 7.8 Counting geodesics on a cone. 

Let A < 27T be the deficit angle of a cone and a = 2n — A the angle at the apex A. 
Consider two points P and Q on the cone separated by an angle <p > 0: the ray from the 
apex through Q is obtained from the ray from the apex through P by a clockwise rotation 
through an angle 0. The number N of geodesics joining the points P and Q is given by 



where [x] denotes the largest integer less than or equal to x. A convenient depiction of the 
conical region and copies of it is shown in Figure 7.8. 

(a) Convince yourself of the validity of (1) by counting the geodesics from P to Q and 
from P to the images of Q developing in the clockwise direction, as well as those from 
P to the images of Q developing in the counterclockwise direction. 

(b) Verify that N is invariant under </> — > a — 0 and explain why it should be so. 

(c) A < 7 r is the case relevant for gravitational lensing. What are the possible values of N 
as a function of 0? 




(2) 




( 1 ) 
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8 World-sheet currents 

L 



For physical insight, physicists often turn to ideas of symmetry and invariance. 
Symmetry properties of dynamical systems and conserved quantities are closely 
related. We will learn that in string theory there are currents that flow on the two- 
dimensional world-sheet traced out by the string in spacetime. The conserved 
charges associated with these currents are key quantities that characterize the 
free motion of strings. We give a simple physical interpretation of the objects 
V z and V a which we encountered earlier. 



8.1 Electric charge conservation 



We begin our study by reviewing the physics and the mathematics of charge conservation 
in the context of Maxwell theory. This classic example will help us to develop a more 
general understanding of the concept of conserved currents. 

In electromagnetism, the conserved current is the four-vector j a = ( cp , j ), where p 
is the electric charge density and j is the current density. Why do we say that j a is a 
conserved current? By definition, j a is a conserved current because it satisfies the equation 

3„;“ = 0. (8.1) 



Any four-vector which satisfies this equation is called a conserved current. The term “con- 
served current” is a little misleading, but it is a convention. More precisely, we should say 
that we have a conserved charge, because it is really the charge associated with the current 
that is conserved. Let us see how this conservation arises. 

When we separate the space and time indices in (8.1) we get 



3o;° + dtf = 



3 f 

dx° 



+ V • j = 0 . 



( 8 . 2 ) 



Why is this equation a statement of charge conservation? In electromagnetism, the total 
electric charge Q(t) in a fixed volume V is just the integral of the charge density p over 
the volume: 



Q(t ) 



= / p(t, x) d 3 x = I 

Jv Jv 



j°(t,X ) 



d 3 x. 



c 



(8.3) 
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Up to a constant, the charge is the integral over space of the first component of the current. 
Its time derivative is given by 



dQ 

dt 



L 



9j° 

dx° 



d?x. 



(8.4) 



Using equation (8.2), we can write 



dQ 

dt 



L 



V • j dr’x. 



(8.5) 



Letting S denote the boundary of V, the divergence theorem gives 



dQ 

dt 




( 8 . 6 ) 



This equation encodes the physical statement of charge conservation: the charge inside a 
volume V can only change if there is a flux of current across the surface S which bounds 
the volume. In many cases, we take V to be so large that the current j vanishes on the 
surface S. In these cases, 




(8.7) 



The charge Q is then time independent and is said to be “conserved.” It is also well estab- 
lished that electric charge is a Lorentz invariant: all inertial observers measuring charge 
obtain the same number. Not all conserved quantities are Lorentz invariant. Energy, for 
example, is conserved, but energy is not Lorentz invariant. These facts are examined in 
more detail in Problems 8.1 and 8.2. 



8.2 Conserved charges from Lagrangian symmetries 



One of the most useful properties of Lagrangians is that they can be used to deduce the exis- 
tence of conserved quantities. Conserved quantities can help us to understand the dynamics 
of a system. In this section we begin our work in the context of Lagrangian mechanics, 
learning how to construct the conserved quantity associated with a symmetry. We then turn 
to Lagrangian densities, and show how to construct the conserved current associated with 
a symmetry. 

Let L(q(t), q(t): t ) be a Lagrangian that depends on a coordinate q(t), the velocity q(t), 
and may even have explicit time dependence. Consider, moreover, a variation of the 
coordinate q{t): 

q(t) -* q(t) + Sq(t), (8.8) 

where Sq (t) is some specific infinitesimal variation. If q (t ) represents the path of a particle, 
for example, the variation above is an instruction on how to change the path: the position 
at time t is changed by Sq(t). Suppose that we have a rule that tells us how to vary any 
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path q(t). This means that given any q(t), we know how to construct the corresponding 
variation 8q{t). Such a rule can be written in the form 

8q(t) = e h(q(t); t), (8.9) 



where e is an infinitesimal constant and h is some function. 

As a result of the change (8.8) of the path, there is an induced change in the velocity q(t): 



q(t ) -* q(t) + 



d(8q(t)) 

dt 



( 8 . 10 ) 



In order to determine how L(q(t ), q(t)\ ?) changes as a result of the variation (8.8), we 
must also vary the velocity q(t) according to (8.10). We will generally speak of the varia- 
tion of q (?) leaving it implicit that the velocity q (?) is also varied accordingly. Because 8q 
is infinitesimal, the variation of the Lagrangian consists only of terms which are linear in 
Sq. If those terms vanish, the Lagrangian is said to be invariant. Moreover, the transforma- 
tion in (8.8) is then said to be a symmetry transformation. A rule that tells us how to vary 
any path in such a way that the Lagrangian is not changed is a symmetry transformation. 
The rule is specified by the function h in (8.9). 

We now state our claim: if the Lagrangian L is invariant under the variation (8.8), then 
the quantity Q, defined by 



3 L 




eQ = —8q, 
dq 


(8.11) 



is conserved in time for physical motion. That is, for any motion q(t) which satisfies the 
equations of motion, the “charge” Q is a constant: 



d(2 

dt 



= 0 . 



(8.12) 



Note that the e on the left-hand side of (8.11) cancels with the e that appears in 8q 
(see (8.9)). 

To prove the conservation of Q, consider the Euler-Lagrange equations that follow from 
the variation of the action S = f dt L (you may have derived them in Problem 4.8). These 
equations are: 



d /3L\ 3 L 

dt\dq) 3 q 



(8.13) 



Since the Lagrangian does not change under the coordinate and velocity variations (8.8) 
and (8.10), we must have 



3 L 
dq 



8q + 



3 L d 
dq dt 



(.Sq) = 0. 



(8.14) 
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Using (8.13) to eliminate P 1 we obtain 



d 

dt 





proving that (8.12) holds for Q defined as in (8.1 1). 



(8.15) 



Let us apply this new perspective on conservation to a Lagrangian L(q{t)) that depends 
only on the velocity. How can we vary q{t) leaving the Lagrangian unchanged? One way 
is to apply q(t ) — » q(t) + e, where e is any constant. In this variation, Sq(t) = e, and the 
function h of (8.9) is just equal to one. This is a uniform space translation: at any time t 
the position of the particle is changed by the same amount e. Correspondingly, the velocity 
does not change: q{t) q(t) + de/dt = q(t). Since the Lagrangian only depends on the 
velocity, it does not change, and we have a symmetry. Making use of (8.1 1), we find 

dL 3 L 3 L 

eQ=^rSq = —e — » Q = — = p. (8.16) 

dq dq dq 

We recognize Q as the momentum associated with the coordinate q. This quantity is con- 
served because the position q does not appear in the Lagrangian. Note that the conservation 
equation dQ/dt = 0 coincides with the Euler-Lagrange equation (8.13). This example 
illustrates a familiar result in Lagrangian mechanics: if the Lagrangian of a system does 
not contain a given coordinate, the momentum conjugate to that coordinate is conserved. 
For a free nonrelativistic particle, for example, L = \m(q) 2 . In this case Q — mq. 

Now let us consider Lagrangian densities with symmetries. While symmetries of a 
Lagrangian guarantee the existence of conserved charges, symmetries of a Lagrangian 
density guarantee the existence of conserved currents. We write the action as the integral 
of the Lagrangian density over the full set of coordinates t- a of some relevant “world”: 

S = j dfd^ x ...d^ k d a (p a ). (8.17) 

Here k denotes the number of space dimensions of the world. The world could be the 
full Minkowski spacetime, some subspace thereof, or, for example, the two-dimensional 
parameter space of the string world-sheet. The fields <p a ( £) are functions of the coordi- 
nates, and 

„ 3 <p a 

d a <P a = ^ (8.18) 

are derivatives of the fields with respect to the coordinates. Each value of the index a 
corresponds to a field. Consider now the infinitesimal variation 

</> fl (f) </» fl (f) + 30 fl (f), (8.19) 

and the associated variation for the field derivatives 3 a (p a . The infinitesimal variations are 
conveniently written as the rule 



Sc/> a = e' h“ (</»), 



( 8 . 20 ) 
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that enables us to vary any arbitrary field configuration. Here, e‘ are a set of infinitesimal 
constants, and, for brevity, we omitted all indices in the arguments of h'- . We have included 
an index in e' because the variation may involve several parameters. A spacetime transla- 
tion, for example, involves as many parameters as there are spacetime dimensions. Since 
the index i is repeated in (8.20), it is summed over. You must distinguish clearly the various 
types of indices we are working with: 

a index to label world coordinates f“, or vector components, 
i index to label parameters in the symmetry transformation, 
a index to label fields in the Lagrangian. (8.21 ) 



If £ is invariant under (8.19) and the associated variations of the field derivatives, then 
the quantities jf defined by 





e' i a - 8<b a 


(8.22) 






are conserved currents: 




o 

II 


(8.23) 



We will prove this shortly. Equation (8.23) holds for any field configuration that satisfies 
the Lagrangian equations of motion. In (8.22) the repeated field index a is summed over. 
If the index i is present in (8.20), then we have several currents indexed by i, one for 
each parameter of the variation. The components of the currents, as many as the number of 
dimensions in the world, are indexed by a. It is important not to confuse the very different 
roles that these two kinds of indices play: 

jf\ i labels the various currents, 

a labels the components of the currents. (8.24) 

We showed in Section 8.1 that a conserved current gives rise to a conserved charge. The 
charge is the integral over space of the zeroth component of the current. It follows that the 
currents /" give rise to the conserved charges 

Qi = J dS l dS 2 ...d^ k jf. (8.25) 

We have as many conserved charges as there are parameters in the symmetry trans- 
formation. 



Quick calculation 8.1 Verify that (8.23) implies that 



dQ, 

di ; 0 



when the currents y “ vanish sufficiently rapidly at spatial infinity. 



(8.26) 
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To prove (8.23), we write both the Euler-Lagrange equations associated with the action 
(8.17) (see Problem 4.8) and the statement of invariance: 



3 C 

30“ 



( 3£ )- — = 0, 
V3(3 „d> a )J 3 cb a 



dC 



^ 3(3 a<P a ) 
3 C 

3(3 a r) 



d a (S(p a ) = 0. 



(8.27) 

(8.28) 



Using the first equation to eliminate from the second, we get 



3 -(j^) sr + ^ 3 ‘ (sr) = 3 ‘( 



3 C 

3(3^°)' 



3 C 



^3(3 a <P a ) 



(8.29) 



which demonstrates that (8.23) holds for currents defined as in (8.22). 

We will use (8.22) to construct conserved currents which live on the string world-sheet. 
Actually, conserved charges and conserved currents exist under conditions less stringent 
than those stated in this section. A transformation can be considered a symmetry if the 
Lagrangian, or the Lagrangian density, if appropriate, is changed by a total derivative. 
These ideas are explored in Problem 8.9 and Problem 8.10. 



8.3 Conserved currents on the world-sheet 



To each string we would like to assign a relativistic momentum p /t which is conserved 
if the string is moving freely. Even though the momentum carries an index, it is not 
a current, but rather a charge. In fact, since each component of /? ;/ should be separately 
conserved, this is a case where we have a set of conserved charges. 

In the notation of the previous section, <2, denote the various charges, so we see that 
the /x index in p jL is playing the role of the i index in Q , ; it labels the various charges. 
What then is the a index in j “ ? We will see that this index labels the coordinates on the 
world-sheet. The currents live on the world-sheet! 

In the string action (6.39), the Lagrangian density is integrated over the world-sheet coor- 
dinates r and er, and not over the spacetime coordinates x jI . In this example, the world of 
(8.17) is two-dimensional, and the index a in (8.21) takes two values. As a result, the con- 
served currents live on the world-sheet: they have two components, and they are functions 
of the world-sheet coordinates. More explicitly, 

S = j oX^tXO, with (£°, f 1 ) = (r, a), (8.30) 

and d a = 3/3£“. Comparing with (8.17), we see that the field variables <p a are simply the 
string coordinates X 11 . Note that the string action only depends on derivatives of the string 
coordinates. 
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To find conserved currents, we need a field variation tiX 1 ' that does not change the 
Lagrangian density. One such variation is given by 



SX^ir ,er) = 



(8.31) 



where c 1 ' is a constant - that is, it does not depend on r or a . This transformation is a con- 
stant spacetime translation: each point on the world-sheet is displaced by the same vector 
c ,L . The Lagrangian density is invariant because it only depends on derivatives 9 cf X /i , and 
their variations vanish: S^aX 11 ) = 9 Q ,(<5X M ) = — 0- The role of the various indices 

in (8.21) is now clear: a is a world-sheet index, i is a spacetime index, and so is a. 

Now let us construct the conserved current. Using (8.22), and letting i and a indices run 
over the values of /x, we have 



e /x ■« = 
J u 



dc 
d{d a xn 



sx » = 



dC 

d(d a xn 



(8.32) 



Cancelling the common factor on the two sides of this equation, we find an expression 
for the currents: 



9£ 



d(d a Xn 



O', 



,-o 



j ii • 



/ 9£ 9£ \ 

- V 9X dX^' /' 



(8.33) 



We have seen such derivatives of C before; they appeared in equations (6.49) and (6.50). 
We can in fact identify 




This is really interesting: the r and er superscripts in V tl label the components of a world- 
sheet current. The equation for current conservation is 

9 p>c 

d a v “ = — ^ ± = 0. (8.35) 

^ 9r 9 ct 

This is simply the equation of motion (6.53) for the relativistic string. 

Since the currents V “ are indexed by /x, the conserved charges are also indexed by /x. 
Following (8.25), to get the charges we integrate the zeroth components V T of the currents 
over space. In the present case, this means integrating over cr: 

P^T)= r Vl{T,a)dG. (8.36) 

Jo 

This integral is done with r held constant. We have called the conserved charges p t , 
because they give the spacetime momentum earned by the string. Indeed, we have seen 
that these charges arise from spacetime translational invariance. Since (8.36) gives the 
total spacetime momentum as an integral over a string parameterized by a , we learn that 
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V T is the a density of spacetime momentum carried by the string. (8.37) 



V* is the derivative of the Lagrangian density with respect to the velocity X 11 , so it has 
the interpretation of a canonical momentum density, in agreement with (8.37). A similar 
identification was obtained for nonrelativistic strings: the quantity V’ , which we defined in 
(4.46), has the interpretation of momentum density on account of (4.43). 

To check conservation, we differentiate (8.36) with respect to r and use the equation of 
motion (8.35): 



dp, i 
dr 



P^ d a = -f 
Jo 3 r Jo 



*1 dV° 



da 



da = -V° 



(8.38) 



For a closed string, the coordinates a = 0 and a = a\ represent the same point on the 
world-sheet, so the right-hand side vanishes. For an open string with free endpoints, the 
boundary condition (6.56) states that V ° vanishes at the endpoints, so, again, the right-hand 
side vanishes. In both of these cases, p jt is conserved: 



dpy, 

dr 



= 0 . 



(8.39) 



There is more to this equation than our previous statement (8.12) of charge conservation. 
The derivative in (8.39) is with respect to r and not with respect to t. So we may ask: is 
p jL conserved in world-sheet time or in Minkowski time? We will discuss this question at 
length in the following section. The short answer is: in both. 

For open strings with Dirichlet boundary conditions along some space directions, the 
momentum carried by the string along those directions may fail to be conserved. Indeed, 
the boundary condition (6.55) does not guarantee that the right-hand side of (8.38) van- 
ishes. We already noted this possible nonconservation for nonrelativistic strings (Section 
4.6). In open string theory, Dirichlet boundary conditions appear when we have D-branes 
that are not space filling. The momentum of the string can then fail to be conserved, but 
the total momentum of the string and the D-brane is conserved. 



( 



8.4 The complete momentum current 



Equation (8.39) is very intriguing. It is a conservation law on the world-sheet rather than 
in spacetime. We could not have expected otherwise - the currents, after all, live on the 
world-sheet: their indices are world-sheet indices, and their arguments are world-sheet 
coordinates. As seen in spacetime, the currents vanish everywhere except on the surface 
traced out by the string. 

If we trust the reparameterization invariance of the physics, we can easily obtain a stan- 
dard spacetime conservation law by choosing the static gauge t = r. The integral in (8.36) 
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is then an integral over the strings, the lines of constant time as seen by the chosen Lorentz 
observer. The conservation law (8.39) becomes 



^=0. 


(8.40) 


dt 





The Lorentz observer confirms that momentum is conserved in time. 

Equation (8.39), together with (8.36), tells us that for any fixed world-sheet parameteriza- 
tion, we can compute a unique quantity p tl using any line of constant r . This quantity must 
coincide with the time-independent momentum p M obtained using the static gauge, as we 
now explain. 

Consider a moving string, a fixed Lorentz observer, and a particular choice of parame- 
terization on the world-sheet. In this parameterization, over some region of the world-sheet 
the lines of constant r are lines of constant t. Asa result, over this region we have the static 
gauge. Over the rest of the world-sheet the parameterization changes smoothly: the lines 
of constant r are no longer lines of constant time. On the static gauge region the integral 
(8.36) gives us the time-independent momentum carried by the strings. Because of (8.39), 
on the rest of the world-sheet the integral (8.36) must still give the same value for p t , , even 
though the lines of constant r are not strings. 

This argument suggests that any curve on the world- sheet can be used to calculate the 
conserved momentum p^. Equation (8.36), however, only tells us how to calculate the 
momentum if the curve is a curve of constant r. We now show how to generalize (8.36) to 
be able to compute the momentum p jt using an (almost) arbitrary curve on the world-sheet 
together with an arbitrary parameterization of the world-sheet. When we deal with open 
strings, the curve must stretch from one boundary to the other. When we deal with closed 
strings, the curve must be a closed noncontractible curve. 

Reconsider equation (8.36), which integrates the r-component of the two-dimensional cur- 
rent ( V *, V " ) over a curve of constant r. The quantity that this integral computes is 
actually the flux of the current across the curve. Since the a -component V" of the cur- 
rent is tangent to the curve of constant r, it does not contribute to the flux. More generally, 
consider an infinitesimal segment (dr, do) along an oriented closed curve T that encloses 
a simply connected region 1Z of the world-sheet (Figure 8.1). Since (dr, do) is parallel 
to the oriented tangent, the outgoing normal to the segment is (do, —dr). It is reasonable 
to define the outgoing flux of the current across the segment as the scalar product of the 
current vector and the outgoing normal vector: 

Infinitesimal flux = (V^, V °) ■ (do, —dr) = 'Pj l do — V° dr. (8.41) 

We now show that the flux, so defined, vanishes when computed across a contractible 
closed curve T on the world-sheet. This is a reasonable result because a contractible curve 
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Left: the total momentum flux out of a simply connected region 1Z of the world-sheet is 
zero. Right: a simply connected region whose boundary includes an arbitrary curve y and a 
string y. 



surrounds a domain 7Z , and we do not expect a domain to be a momentum source or sink. 
The outgoing flux across T is written as 

p„(T) = j)^(VldcT-Vldr). (8.42) 

By the two-dimensional version of the divergence theorem, the flux of the current V “ out 
of TZ is equal to the integral of the divergence of over 7 Z: 

r / dV T dV a \ 

p,i(r)= L\^ + ^ L ) drd(j =0, (8 ' 43) 

since V “ is a conserved current. This is what we wanted to show. 

Quick calculation 8.2 Consider R 2 with coordinates (x, y ) and a simply connected region 
M surrounded by a boundary T with counterclockwise orientation. The divergence theorem 
for a vector (A x , A y ) reads 

/ v v rr ,dA x 9A>\ 

(D (A dy — A y dx) = I I ( 1 )dxdy. (8.44) 

Jr ' JJm v d* 3y > 

Verify that this equation holds for a tiny rectangle with corners (xo, yo), (xo + dx, yo), 
(xo + dx, yo + dy), and (xo, yo + dy). This suffices to prove (8.44) by breaking M into 
a collection of tiny rectangles. We used (8.44), for an R 2 with coordinates (r, a) and a 
vector ( V to pass from (8.42) to (8.43). 

We now generalize (8.36) as follows. For an arbitrary curve y that starts on the a — 0 
boundary of the world-sheet and ends on the a = cq boundary, we define 

P*(Y)= f{K d °-K d *). (8-45) 

Jy 

If y is a curve of constant r, then dr = 0 all along y, and p^iy) reduces to (8.36). We 
now prove that p^iy), as defined in all generality by (8.45), actually coincides with /y , , as 
defined in (8.36). Consider a curve y that stretches from one side of the world-sheet to the 
other, and a curve y of constant r , as shown to the right of Figure 8.1. Let a and fi denote 
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A closed string world-sheet with an arbitrary nontrivial closed curve y and a closed curve y 
at constant r. The region between the curves is 1Z. 



oriented paths along the world-sheet boundary such that the closed curve T surrounding 
the shaded region is given by 

r — y — P — y + a. (8.46) 

The curve T is oriented counterclockwise and it is contractible. As a result, the flux /? /; (F) 
vanishes: 

= f r (VI do - v; dr) = (J - 1 + jf - Q (VI do - VI dr) = 0. (8.47) 

Since a and are curves where do vanishes, only V" dr contributes to the integrals. 
But V ° vanishes at the string endpoints (for free endpoints), so these integrals vanish 
identically. Only the integrals over y and y survive, so we have 

[ (K' da - K dr) = f (K do - v; dr) = f VJ t do = p„ (8.48) 

J y «/ y J y 

where we noted that dr = 0 on y and used (8.36). This proves that Ppiy) = /y, for any 
contour y which connects the o — 0 and o — o\ boundaries of the world-sheet. We can 
thus rewrite (8.45) as 



/V = f ( V l do - VI dr) . 


(8.49) 


Jy 





Conservation is now the statement that the integral above is independent of the chosen 
curve y, as long as the endpoints of y lie on the boundary components of the world-sheet. 

The arguments are similar in the case of closed strings. We consider an arbitrary nontriv- 
ial closed curve y winding once around the world-sheet, and another similarly nontrivial 
closed curve y of constant r, for some arbitrary, but fixed, parameterization. The two 
curves form the boundary of an annular region 'R, (Figure 8.2). A completely analogous 
argument shows that both contours give the same result for p M . Therefore, we can calcu- 
late the momentum of the closed string using any closed curve that winds once around the 
world-sheet. 

How does an arbitrary Lorentz observer use equation (8.49)? The observer looks at a string 
at some time t, and asks for its momentum. This requires the use of (8.49), with a curve y 
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which corresponds to the string in question. With an arbitrary parameterization y need not 
be a constant r curve. At some later time t' , the observer asks again for the momentum. At 
this time, the string corresponds to a curve y' , genetically different from y. By the curve 
independence of (8.49), the observer concludes that the momentum did not change. The 
momentum was conserved in time. 

We conclude this section by making contact with familiar results. Using a constant r 
string y to evaluate (8.49) we have 

p° = I V r0 do, p= j V T do. (8.50) 

J y J y 

The values of the momentum densities in static gauge can be read from (6.94). We thus 
find that the energy and the spatial momentum of the string are given by: 



P 



o _ 





Tods 





(8.51) 



The above equation for the energy of the string coincides with (7.9). The expression for 
the momentum is reasonable: the momentum carried by a little piece of string is given by 
its rest mass Tods /c 2 times its velocity, times its relativistic factor y . 



8.5 Lorentz symmetry and associated currents 



By construction, the action of the relativistic string is Lorentz invariant. It is written in 
terms of Lorentz vectors that are contracted to build Lorentz scalars. Concretely, this means 
that Lorentz transformations of the coordinates X p leave the action invariant. In this section 
we will construct the conserved charges associated with Lorentz symmetry. 

These charges will be particularly useful when we study quantum string theory in 
Chapter 12. Whenever we quantize a classical system, there is the possibility that cru- 
cial symmetries of the classical theory will be lost. If Lorentz invariance were lost upon 
quantization, quantum string theory would be very problematic, to say the least. We will 
have to make sure that the quantum theory is Lorentz invariant. To calculate the conserved 
charges, we first need the infinitesimal form of the general Lorentz transformations. We 
recall (Section 2.2) that Lorentz transformations are linear transformations of the coor- 
dinates X p that leave the quadratic form r] /JlV X fl X v invariant. Every infinitesimal linear 
transformation is of the form X 11 — > X 1 ' + 8X P , where 

8X p = e p ' v X v . (8.52) 

Here e pA! is a matrix of infinitesimal constants. Lorentz invariance imposes conditions on 
the constants e /iv . We require S(rj llv X IJ 'X v ) = 0, and therefore 

2 r ll , v (8X^X v = 2r, llv (f pp X p ) X v = 2^ p X p X IM = 0. (8.53) 

Imagine decomposing the matrix e into its antisymmetric part and its symmetric part. The 
antisymmetric part would not contribute to e ,lp X p X p . The vanishing of c pp X p X p , for 
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all values of X ll: implies that the symmetric part of e is zero. It follows that the general 
solution is represented by an antisymmetric e /iv : 

€/*" = -e v/i . (8.54) 

Infinitesimal Lorentz transformations are thus very simple: they are transformations of 
the form tiX 1 ' = € ,lv X v , with e ,il! antisymmetric. This result applies for any number of 
spacetime dimensions since Lorentz transformations always leave r] flv X fl X v invariant. 

Quick calculation 8.3 Consider a fixed 2-by-2 matrix A ab ( a,b= 1,2) that satisfies 
A ab v a Vb = 0 for all values of v\ and i> 2 . Write out the four terms on the left-hand side of 
the vanishing condition, and confirm explicitly that the matrix A ab must be antisymmetric. 

Quick calculation 8.4 Repeat the above exercise for a 4-by-4 matrix e /I1 ' (ji, v = 
0, 1, 2, 3) that satisfies e /J ’ v v /J ,v v = 0 for all values of vq, Vi, i>2, and 1)3. 

Quick calculation 8.5 Show that (8.54) implies e^ v — — e V|Lt . 

Quick calculation 8.6 Examine the boost in (2.36) for very small /3 . Write x' ,L = x 11 + 
x llv x v and calculate the entries of the matrix e llv . Show that e 10 = — e 01 = ft, and that all 
other entries are zero. This confirms that e liv is antisymmetric for an infinitesimal boost. 



Let us now show explicitly that the string Lagrangian density is invariant under Lorentz 
transformations. All terms that appear in this density are of the form 



dX » dX v 
^ 3£“ 9p" 



(8.55) 



where and ^ are either r or a . We claim that any such term is Lorentz invariant. Indeed, 



^ dX^dX v ^_ ( d&X* 1 dX v | 3 X^dSX v \ 
S y lfiv ~d^~~d^') ~ ^ v v d$ a + 

M 6 3£“ dt-P + 3£“ 3 $p) 

3 XP dx v | __ 3Z M 3 XP 
~ €v p +e w a ^ a 



where we used rj to lower the first index on the f constants. Letting /x — >• p and p — > v in 
the second term we get 



( dXP dX v \ ^ , ^dXP dX 

yl/xv ) — (Cyp + Cpv) ~ 



because of the antisymmetry of e. This explicitly proves the Lorentz invariance of the string 
action. 



We can now use equation (8.22) to write the currents. It follows from (8.52) that the role 
of the small parameter e' is played by f / ' u . We therefore have 
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The current j“ v , since it is multiplied by the antisymmetric matrix e ,lv , can be defined to 
be antisymmetric - any symmetric part would drop out of the left-hand side. Using the 
antisymmetry of U'L the right-hand side is written as 

^ v j“ u = i-yn - *vv;) . (8.59) 

The currents can be read directly from this equation because the factor multiplying c ,LV 
on the right-hand side is explicitly antisymmetric. Since the overall normalization of the 
currents is for us to choose, we define the currents A4“ y by 

M% = X IjL P« - X V P«. (8.60) 



By construction, 

M a = -M a 

J 1 nv J l V/l’ 

The equation of current conservation is 

dM% 

dr + da 

and the charges, in analogy with (8.49), are given by 



{M^da - M” v dr) . 


(8.63) 


Jy 





(8.61) 



(8.62) 



The charges, just like the currents, are antisymmetric: 

M„ v = -M v „. (8.64) 

The conservation of M [xv is a result of the contour independence of the definition (8.63). 
For closed strings, contour independence is guaranteed by the argument given earlier in the 
case of momentum charges. For free open strings, one point must be addressed to ensure 
contour independence. The M jLV integrals must receive no contributions from lines on the 
boundary of the world-sheet. This, in turn, requires the vanishing of on the boundary. 
This condition is satisfied because A4" y involves V° multiplicatively, and V n vanishes on 
the world-sheet boundary. As explained for the case of the momentum charges, a Lorentz 
observer measuring M jLV using strings at different times will conclude that d M llv /dt = 0. 
We can also compute the Lorentz charges M /l v using constant r lines. In that case. 



M^v = 



= J M x lJtv {x,(T)dcT = J ( 



{XuVl - X , 



VI) da. 



(8.65) 
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Since the M^ v are antisymmetric, we have six conserved charges in four dimensions. Let- 
ting i and j denote space indices. Mo,- are the three charges associated with the three basic 
boosts, and M (/ are the three charges associated with the three basic rotations. Given that 
V T is the momentum density, the normalization chosen in ( 8 . 60 ) ensures that A 4 J- is pre- 
cisely the angular momentum density. As a consequence, the components Mjj measure 
the string angular momentum L via the usual relations L, = Kijk^d jk- Here is the 
totally antisymmetric symbol which satisfies e 123 = 1 . More explicitly, we have L \ = M23, 
L2 = M31, and L3 = M\2- 

The charges associated with the boosts are: 

M 0i = J do(ctV zi - X ! V r0 ) = ct p‘ - J dorr' 0 . (8.66) 

Multiplying by c/E, where E is the conserved energy of the string. 



cM°‘ c 2 p 

= t 

E E 



J doX‘ cT r0 . 



( 8 . 67 ) 



Since cV T ° is the energy density along the string, the last term on the above right-hand side 
can be identified with the time-dependent position X' cm (t) of the center of mass (energy) 
of the string. We thus obtain. 



Km (0 = 



cM' 



0i 



+ t 



C 1 



(8.68) 



The quantity c 2 p' /E has the interpretation of center of mass velocity because it coincides 
with the velocity of a point particle with momentum p' and energy E. Equation (8.68) 
describes the motion of the center of mass. The conserved charges M 0 ', together with E, 
determine the t — 0 position of the center of mass. 



8.6 The slope parameter a' 



The string tension To is the only dimensionful parameter in the string action. In this section 
we will motivate the definition of an alternative parameter: the slope parameter a 1 . These 
two parameters are related, we can use one or the other. The parameter a 1 has an interesting 
physical interpretation, used since the early days of string theory. If we consider a rigidly 
rotating open string, a! is the proportionality constant that relates the angular momentum 
J of the string, measured in units of h, to the square of its energy E. More explicitly, 

- = a'E 2 . ( 8 . 69 ) 

n 

Since the left-hand side has no units, the units of a ’ are those of inverse energy-squared: 



( 8 . 70 ) 
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The appearance of ft in (8.69) is just a convention. The constant a! was introduced in the 
quantum theory of strings, and its relation to the string tension involves ft. For our present 
purposes, however, the important fact is just the proportionality between J and E 2 . This 
relation does not involve ft when we use the string tension Tq. 



To verify the proportionality implied in equation (8.69), we consider a straight open string 
of energy E which rotates rigidly on the (x, y ) plane. This is precisely the problem exam- 
ined in Section 7.4. The only nonvanishing component of angular momentum is A7| 2 , and 
its magnitude is denoted by J = \Mu\- Equation (8.65) tells us that 

M U = ( \xi V r 2 -X 2 Vl)da. (8.71) 

Jo 

To evaluate this integral we need formulae for the position and momenta of the rotating 
string. We recall equation (7.63), 



- cr\ no ( net nct\ 

X(t, a) = — cos ( cos , sin 1, 

n <Ji V a i <7i J 



(8.72) 



which records the components {X\,X 2 ) of the rotating string. Using equation (7.31), 
we find 



TodX Tq no/ net nct\ 

= — — = — cos 1 — sin , cos ) , 

- dt C <71 V CT 1 <71/ 



(8.73) 



where the right-hand side gives the components (' P[, VJ ) . The integral in (8.71) is thus 
given by 



<7i T 0 



nai 

M \ 2 = — — I cos z — do = 

n c Jo oi 2nc 



no 



<7f To 



(8.74) 



The time dependence disappeared, as expected for a conserved charge. Since J = M \ 2 
and o i = E/ Tq, we have found that 



J = 



1 

2nToc 



E 2 . 



(8.75) 



As anticipated, the angular momentum is proportional to the square of the energy of the 
string. Comparing with equation (8.69) we deduce that 



/ 

a 



1 

2nTo ftc 



and 



To = 



1 

2na' ftic 



(8.76) 



These equations relate the slope parameter a' to the string tension Tq. 

Quick calculation 8.7 To appreciate how unusual the relation J ~ E 2 is, consider a one- 
dimensional straight bar of fixed length and fixed (uniform) mass that is rotating around 
its midpoint. Show that the nonrelativistic energy and the angular momentum are related 
by J ~ y[E. 
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One may have anticipated the relationship / ~ E 2 by the following admittedly rough argu- 
ment. We know that for rigid rotations J = /&>, where I is the moment of inertia. For an 
object of mass M and length scale L, we have I ~ ML 2 . So, J ~ ML 2 co. For a rotating 
relativistic string M ~ E, L ~ E, and co ~ l/E (see (7.57)). As a result J ~ E 2 . 

The name slope parameter arises because a' is the slope of the lines of J /h, when plotted 
as a function of energy- squared. In fact, Regge trajectories are approximate lines that arise 
when plotting angular momentum as a function of energy-squared for hadronic excitations. 
In the early 1970s, when string theory was investigated as a theory of strong interactions, 
the slope parameter a' was the experimentally measured quantity that entered into the 
string action. The action (6.39), which we wrote in terms of To, takes the form 

S = - 2jf J hc2 J* f dr jP da J (X ■ X') 2 - (X) 2 (X ') 2 ■ (8-77) 

It turns out that most of the modern work on string theory uses the slope parameter a ' 
as opposed to the string tension 7o. In Section 3.6 we used h. c, and Newton’s constant G 
to calculate a characteristic length Ip called the Planck length. In string theory, we can use 
h, c, and the dimensionful parameter a' to construct a characteristic length l s called the 
string length. 

Quick calculation 8.8 Show that 



l s = he \fa! . 



(8.78) 



Up to factors of h and c, the string length l s is the square root of a'. This connection 
to a fundamental length scale provides an alternative physical interpretation for the slope 
parameter a 1 . 



Problems 



Problem 8.1 Lorentz invariants and their densities. 

We want to understand how Lorentz invariance of electric charge implies that charge den- 
sity is the zeroth component of a Lorentz vector. 

Consider two Lorentz frames S and S' with S' boosted along the +x direction of S with 
velocity v. In frame S we have a cubic box of volume L 3 with edges, all of length L, 
aligned with the axes of the coordinate system. The box is at rest and is filled by a material, 
also at rest, that has uniform charge density po- In this frame the current density vanishes 
j = 0. Note that S' sees the box Lorentz contracted along the x direction. 

(a) Use the Lorentz invariance of charge to calculate the charge density and current density 
( cp ' , j') that describe the material in the box according to frame S'. Verify that the 
values (epo, 0) and {cp' , j ') behave as four-vectors j ^ = (epo, 0) and j" 1 = {cp', j '), 
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in the sense that the quantities (cp ' , j') are obtained from the rest frame values (cpo, 0) 
by a Lorentz boost. 

(b) Write a formula for the current j ^ in terms of po and the four- velocity of the box, and 
check that your result works both for frames S and S ' . 



Problem 8.2 Lorentz invariance of electric charge. 



The explicit test of Lorentz invariance of electric charge is quite subtle, so this is a chal- 
lenging problem! We consider, as usual, a frame S and a frame S' moving along the 
+x direction of S with velocity v. In frame S the total charge is given by integration over 
all of space at a fixed time t = 0: 

Q = J d 3 x p(t— 0, x), 

and in the primed frame S' it is given by an integral over all of space at t' = 0: 

Q' = J d 3 x' p'(t'=0,x'). 

We want to prove that Q' = Q. The complication is that S and S' do not agree on what is 
equal time. We will assume that j a — ( cp , j) is a four-vector and it is conserved: d a j a = 0. 
Moreover, charge densities and currents are taken to vanish at infinity for all times. 



(a) Write explicitly the transformations that relate ( t , x) to (t ' , x') and (cp, j) to (cp ' , j'). 
Think of p and j as functions of arguments (f, x, y, z). Prove that 



Q 






Y 



vx , , , 

Y~ 2 ~. YX ,y ,z 



)J 



VX ill 

y ~c2’ yx ’ z 



To make Q' look more like the expression for Q, change integration variables using 
x = yx', y = y' , and z = z' ■ Write the resulting expression for Q' . Confirm that for 
v — 0 you get Q' = Q. 

(b) The strategy is to show that Q' is independent of the velocity v. In that case, the 
equality of Q' and Q for v = 0 implies the equality of Q' and Q for all velocities. 
Show that the derivative of Q' with respect to the velocity can be written as 



dQ' 

dv 




■ (dp 




L U 


c 2 3 1 J 





,x,y,z 



( 1 ) 



(c) Use the conservation equation 3 a j a = 0 to show that the integrand in (1) is a total 
derivative. It then follows from our conditions at infinity that = 0, completing the 
proof that Q' = Q. You may find the following equation useful: 



(d . X (vx \\ [ v dj x dj x 



(2) 



where the subscripts y, z on the left-hand side indicate that the partial derivative is 
taken with these variables (and not t ) held fixed. Equation (2) is just a statement about 
derivatives and should be clear if you understand the notation. 
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Problem 8.3 Angular momentum as a conserved charge. 

Consider a Lagrangian L that depends only on the magnitude of the velocity q(t) of a 
particle which moves in ordinary three-dimensional space. 

(a) Write an infinitesimal variation Sq(t) that represents a small rotation of the vector q(t). 
Explain why it leaves the Lagrangian invariant. 

(b) Construct the conserved charge associated with this symmetry transformation. Verify 
that this conserved quantity is the (vector) angular momentum. 

Problem 8.4 A generalization for charges and a special case for currents. 

(a) Generalize equations (8.9) and (8.1 1) to the case in which there are various symmetry 
parameters e l and a set of coordinates q a . Verify that your charges are conserved. 

(b) Consider the setup of (8.17) for a “world” with no spatial dimensions. What are the 
possible values of the index a? What do equations (8.22), (8.23), and (8.25) give? 
Compare with your results in part (a). 

Problem 8.5 Lorentz charges for the relativistic point particle . ' 

Consider the point particle action (5.15): S = — me f drJ—ri llv xi J -x v , where i M (r) = 
dx^(j)/dr. This action can be treated as a mechanics action where x ,L is a coordinate 
and r plays the role of time, or as a field action where x 1 ' is a field in a world with no 
spatial dimensions and with = r. 

(a) Show that x^ix) -> x M (r) + e 11 , with constant, is a symmetry. Find the con- 
served charges associated with this symmetry and verify explicitly their conservation. 
Compare with the momenta ( r ) defined canonically from the Lagrangian. 

(b) Write the infinitesimal Lorentz transformations of x ii (t), and explain why the action 
is invariant under such transformations. 

(c) Find an expression for the Lorentz charges in terms of x^(r) and p ll (x). Make sure 
that your Lorentz charges coincide with angular momentum charges for the appropriate 
values of the indices. Verify explicitly their conservation. 

Problem 8.6 Simple estimates regarding a', 7o, and l s . 

(a) In hadronic physics a ’ — 0.95 GeV -2 . Calculate the hadronic string tension in tons 
and the string length in centimeters. 

(b) Assume that the string length is l s ~ 10 -30 cm. Calculate a' in GeV -2 and the string 
tension in tons. 



Problem 8.7 Angular momentum of a rotating string. 



Equation (8.51) tells us that the momentum dp carried by a small piece of string is 
given by 



„ T 0 ds 

dp = — 5 - 

c z 
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Use this result to calculate by direct integration the angular momentum J carried by a 
rotating open string. Make sure that your result coincides with (8.75). 

Problem 8.8 Angular momentum of Kasey’s jumping rope. 

Consider the relativistic jumping rope solution of Problem 7.4. Calculate the magnitude 
/, of the z-component of angular momentum (measured with respect to the origin) as a 
function of the separation Lq and the angle y. Show that J z /ti = (sin 2 y)a' E 2 , where E is 
the energy of the string. 

Problem 8.9 Generalizing the construction of conserved charges. 

We reconsider the setup that led to the conserved charge Q defined in (8.11). Assume 
now that the transformation (8.8) does not leave the Lagrangian L invariant, but rather the 
change is a total time derivative: 

5L = ^(€ A), (1) 

at 

where A is some calculable function of coordinates, velocities, and possibly of time. Show 
that there is a modified conserved charge which takes the form 

3L 

e Q = — Sq - e A. (2) 

aq 

Since it leads to a conserved charge, a transformation that changes L by a total derivative 
is said to be a symmetry transformation. 

As an application consider a Lagrangian L(q(t),q(t)) that has no explicit time 
dependence. A transformation of the form 

q(t) — * q(t + €)~q(t) + eq(t), (3) 



represents the effect of a constant infinitesimal time translation. Show that the transforma- 
tion (3) is a symmetry in the sense of (1). Calculate A and construct the conserved charge 
Q . Is the result familiar? 

Problem 8.10 Generalizing the construction of conserved currents. 



Assume that the transformation (8.19) does not leave the Lagrangian density C invariant, 
but rather the change is a total derivative: 



SC = — (e* A“), 
3 £“ ‘ 



( 1 ) 



where the A" are a set of calculable functions of fields, field derivatives, and possibly 
coordinates. Show that there is a conserved current which takes the form 



e Ji 



3 C 

3(d a r) 



8<p a - € l A“ 



( 2 ) 



Since it leads to a conserved current, a transformation that changes £ by a total derivative 
is said to be a symmetry transformation. 
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As an application consider a Lagrangian density C(<p a , 3 a <P a ) that has no explicit 
dependence on the world coordinates f The transformation 

<t> a {S p ) — ► 4> a (£ p + — 4> a + c p dp<l> a , (3) 

where is an infinitesimal constant vector, represents the effect of a constant translation. 
Show that (3) is a symmetry in the sense of (1), with 



A“ = S? p C. 

Show that the conserved currents j a p take the form 

•a _ ^ 

J P~ 



3(3 a<P a ) 



8“C. 



(4) 



( 5 ) 



Is j q familiar? The quantities j a p actually dehne the energy-momentum tensor T a p . 



c 




A class of gauges is introduced that fix the parameterization of the world-sheet, 
lead to a pair of constraints, and give equations of motion that are wave equa- 
tions. One of these gauges, the light-cone gauge, sets X + proportional to r and 
allows for a complete and explicit solution to the equations of motion. In this 
gauge, the dynamics of the string is determined by the motion along the trans- 
verse directions and the values of two zero modes. We encounter the classical 
Virasoro modes as the oscillation modes of the X~ coordinate, and we learn 
how to calculate the mass of an arbitrary string configuration. 



Our first encounter with classical string dynamics was simplified by our use of the static 
gauge. In this gauge the world-sheet time r is identified with the spacetime time coordinate 



We are now going to consider more general gauges. Among the class of gauges that we 
will examine, one of them, the light-cone gauge, will turn out to be particularly useful. 
Using this gauge we will solve the equations of motion of the string in a complete and 
explicit fashion. Our solution in the static gauge was not fully explicit; the motion was 
characterized by a constrained vector function (see (7.48)). 

The gauges we will focus on are those for which r is set equal to a linear combination of 
the string coordinates. This condition can be written as 



If we choose = ( 1,0,..., 0) and ’/. = c, this equation becomes (9.1). To understand the 
meaning of (9.2), we introduce the related equation 



9.1 A class of choices for r 



X° by 



X°(r, cr) = cr. 



(9.1) 



X^(x, cr) = Xx. 



(9.2) 




(9.3) 



where we write x^, as opposed to X 1 ' , to emphasize that we are dealing with the general 
spacetime coordinates. Consider now the two equations above with the same fixed value 
of r. If x^ 1 and x% are two points which satisfy (9.3), then « M (x^ — x^) = 0. This shows 




176 



Light-cone relativistic strings 




The gauge condition n ■ X = k r fixes the strings to be the curves at the intersection of the 
world-sheet with the hyperplanes orthogonal to the vector n^. 



that any vector that joins points in the space (9.3) is orthogonal to the vector n tl . The set of 
all points which satisfy (9.3) forms a hyperplane normal to n 1 '. 

We can now make clear the meaning of equation (9.2). The points X^ that satisfy 
iiji X 1 ' = at are points that lie both on the world-sheet and on the hyperplane (9.3). Equa- 
tion (9.2) states that all these points must be assigned the same value r of the world-sheet 
time. If we define a string to be the set of points X ll (r. cr) with constant r, in the gauge 
(9.2), strings lie on hyperplanes of the form (9.3). The string with world-sheet time r is the 
intersection of the world-sheet with the hyperplane n ■ x — at, as illustrated in Figure 9.1. 

We want strings to be spacelike objects. More precisely, the interval AX' 1 between any 
two points on a string should be spacelike, perhaps null in some limit, but certainly never 
timelike. We will now see that in the gauge (9.2), a timelike n 11 guarantees that the string 
is spacelike. In this gauge any interval AX 1 L along the string satisfies n ■ AX = 0. Since 
this condition is Lorentz invariant, it can be analyzed in a Lorentz frame where the only 
nonzero component of n 1 ' is the time component. In this frame AX 1 ' cannot have a time 
component. It is therefore a spacelike vector. 

If n ^ is a null vector ((1, 1, 0, . . ., 0), for example), one can show that n ■ AX = 0 
implies that AX 1 ' is generally spacelike and occassionally null (Problem 9.1). We will 
allow to be null in (9.2). This choice can be viewed as the limit of a sequence of choices 
all of which involve a timelike n 1 ' . 

The gauges defined by (9.2) are not Lorentz covariant gauges for any choice of n 1 '. 
Choosing n 1 ' selects a particular linear combination of spacetime coordinates to be set 
equal to r. There is no linear combination of coordinates that is left invariant by arbitrary 
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Lorentz transformations. Therefore, the gauge condition takes different forms in different 
Lorentz frames - the gauge is not Lorentz covariant. 

At this point, it is convenient to streamline the way that we deal with units. While we have 
been using the convention that r and a have units of time and length, respectively, starting 
now r and cr will be dimensionless. We proved in Chapter 8 that for open strings with 
free endpoints, there is a well defined conserved momentum p^. We will use this Lorentz 
vector to rewrite our gauge condition (9.2) as follows: 

n ■ X(r, cr) = k (n ■ p) r. (9.4) 

Since n ■ pis a constant, the net effect is that we have traded k for another constant k. 
When open strings are attached to D-branes not all components of the string momentum 
are conserved. Since we want our analysis to hold even in this case, we will assume that 
the vector n 11 is chosen in such a way that n ■ p is conserved. This condition is weaker 
than the condition of momentum conservation. We will assume that n ■ V n = 0 at the 
open string endpoints since this condition naturally guarantees the conservation of n ■ p 
(consider equation (8.38) dotted with n 1 '). 

By involving the vector n 11 explicitly on both sides of equation (9.4), the scale of n 1 ' 
has been made irrelevant. Only the direction of n 1 ' matters. Bearing in mind that n ■ X has 
units of length, n ■ p has units of momentum, and r is dimensionless, imagine dividing 
both sides of this equation by the unit of time. We then see that k has units of velocity 
divided by force. The canonical choice for velocity is c, and the canonical choice for force 
is the string tension Tq. It is therefore natural to set 

k ~ — = 2na' he 1 , (9.5) 

?o 

where we used (8.76) to relate the string tension to a' . Before fixing k precisely, let us 
further simplify our treatment of units. 

At stake is our ability to track the units of different physical quantities. We can simplify 
this matter by deciding to track just one unit, instead of the three units of length, time, and 
mass. By convention, we do this by setting two of the fundamental constants equal to one: 

h=c= 1, (9.6) 

as if these constants had no units! This has two consequences. First, any h or c in our 
formulae disappears without leaving a trace. This is not a serious problem, if we know the 
full units of an expression where h and c have been set equal to one, we can reconstruct 
the dependence on h and c unambiguously. Second, the units become dependent, and we 
are left with just one independent unit. Since [c] = L/T , the condition c = 1 implies that 

L = T. (9.7) 

At this stage [ h ] = ML 2 /T becomes [/i] = ML. With h = 1 we get 



M = 1/L . 



(9.8) 
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Thus we can write all units in terms of mass or length (nobody uses time!). We will say 
that we are working with natural units when we set h = c = 1 and track just one unit. 
Back to (9.5), the units of a' have now become 



[«'] = 



1 




(9.9) 



While the complete units of a 1 are those of inverse energy-squared, we see that in natural 
units, a 1 has units of length-squared. This is in agreement with our result in (8.78). In 
natural units the string length is 




For reference, in natural units the Nambu-Goto action (8.77) takes the form 



S = ~ i / f dr i 1 d(Ty l (jC ' X ' )2 - {±)2{X>)2 ■ (9 - U) 

In natural units equation (9.5) sets X proportional to a ’ . For open strings we choose X = 
2a ' , and equation (9.4) becomes 

n ■ X(r , <r) = 2 a' (n ■ p ) r (open strings). (9.12) 

This is the final form of the gauge condition which fixes the r parameterization of the 
world-sheet. 

When we use natural units, length scales can be expressed in terms of mass or energy 
scales. Given a length l we can construct a unique mass m using only l, ti, and c. This 
unique mass is m — h/(lc), or equivalently, me 2 — tic /i. For quick estimates we use he — 
200 MeV x 10“ 15 m. 

Quick calculation 9.1 Show that the energy equivalent of the length 10~ 18 cm of a large 
extra dimension is roughly 20 TeV (1 TeV= 10 12 eV). 



9.2 The associated a parameterization 



Having fixed the r parameterization, let us determine the appropriate a parameterization. 
In the static gauge the a parameterization was fixed by the condition of constant energy 
density V T ° over the strings (the curves of constant r). Indeed, in setting A(er) in (7.17) 
equal to one by a suitable choice of a, we made V z0 in (7.6) constant. Since the static 
gauge uses n ^ = (1, 0, . . ., 0), we were actually demanding the constancy of n^V 111 . 





179 



9.2 The associated a parameterization 



The proper generalization to situations where n 11 is arbitrary is to demand the con- 
stancy of P TI ‘ = n ■ V z over the strings. Additionally, we require a parameterization 
range a e [0, jt] for all open strings. To show that we can satisfy these conditions we 
examine how P z,l (t, a) transforms under a reparameterizations. Looking at (6.93) we 
note that the rightmost fraction is invariant under a reparameterizations, and only the 
derivative ^ transforms. It follows that given two parameterizations a and <r of a string 
we have 

P T/i (r,a) = ^ n ■ V T (r, a) = n ■ V r (r, a). (9.13) 

da da 

If we are handed a string with a parameterization in which n ■ V T {r , a) depends on a, 
we can choose a a parameter so that n ■ V T (r, a) does not depend on a. This is sim- 
ply done by adjusting the value of ^ so that the final right-hand side above is set equal 
to a number that can only depend on r. We then do a further reparameterization that 
scales o’ by a constant factor b: a — > ba. This preserves the a -independence of n ■ V z 
but allows us to have a e (0, jr], by adjusting b suitably. In this final parameterization 
we have 

n ■ V t {t, a) = a(r), (9.14) 



where a(r) is some function of r. In fact air) cannot depend on r, and its value is already 
fixed by the conditions we have imposed. Integration of (9.14) over the string a e [0, n] 
gives 




da n ■ V T (t, a) = n ■ p = jra(r) 



a(r) 



n ■ p 
n 



(9.15) 



Since n ■ p is conserved, a(x) does not depend on r. Back in (9.14) we have learned that 



n ■ V z = 



n ■ p 

7t 



is an open string world-sheet constant. 



(9.16) 



In this parameterization the momentum density n ■ V T is constant so the cr value assigned 
to a point is proportional to the amount of n ■ p momentum carried by the portion of string 
from the endpoint a = 0 to the point. 



Let us now examine the equation of motion 3 r P^ + d a 'P^ = 0. Dotting this equation with 
n 1 ' we arrive at the condition 

3 3 

(n-V z ) + —{n-V°) = 0. (9.17) 

9r da 

The first term vanishes on account of (9.16), and we are left with 



— (n • V a ) = 0. 
da 



This means that n ■ P' 1 is independent of a . 



(9.18) 
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We now explain that for open strings, n ■ V n — 0. Because of equation (9.18), it suffices 
to show that n ■ V n vanishes at some point on each string. As noted below equation (9.4), 
we assume n ■ V n = 0 at the string endpoints since it guarantees the conservation of n ■ p. 
Thus, at least for open strings, 

n-V a = 0. (9.19) 



Closed strings are dealt with in a similar way. We also parameterize them making the 
momentum density n ■ V z constant and, again, we conclude that n ■ V T is a world-sheet 
constant. For closed strings we want a range a e [0, 2jr], so (9.16) is changed into 

n ■ V T = - — — (closed strings). (9.20) 

2 TC 

Given this change, it will be convenient to write the r parameterization of closed strings 
without the factor of two present in (9.12): 

n ■ X(t, a) = a'(n ■ p) x (closed strings). (9.21) 



While equation (9.18) also holds for closed strings, we cannot prove that n ■ V° — 0; 
there is no special point on a closed string where n ■ V n is known to vanish. We note, in 
addition, a related subtlety: it is not clear how to select the point a = 0 at each value of 
r. The two problems can be solved at once. Viewing the closed string world-sheet as a 
collection of closed strings (the curves of constant r), we will select cr = 0 on one string 
arbitrarily. We will then select cr = 0 on all other strings by requiring n ■ V a = 0. 

To show how this is done, we begin by using (6.50) to compute n ■ V a : 

v a _ 1 (X ■ X')d T (n ■ X) - (X) 2 d a (n ■ X) 

2ttoi' ■ X') 2 - (X) 2 (X') 2 

From (9.21) we see that d a (n ■ X) = 0, and therefore n ■ V a is 



1 (X-X')d r (n-X) 
y/(X ■ X’) 2 - (Z) 2 (X') 2 ’ 



(9.23) 



It suffices to prove that we can make n ■ V a vanish at one point on each string. Since 
3 t (n ■ X) is a constant, we must show that X ■ X 1 = 0 at some point on each string. 



Pick an arbitrary point P on a given string, and declare it to be a point for which o = 0 
(Figure 9.2). We claim that there is (up to scaling) a unique vector v 11 tangent to the world- 
sheet at P and orthogonal to the spacelike vector X 11 ' tangent to the string. This is not 
difficult to see. The world-sheet has a timelike tangent vector t 11 at P . Since X 11 ' and t j ‘ 
are not parallel, they generate the tangent space to the world-sheet at P . If t,jX IJ l = 0, then 
t ^ is the desired vector v 11 . If t^X^’ ^ 0, we can define 

= t* + bX 11 ', (9.24) 



and solve for the constant b such that v^X 11 ' = 0: 



„ „ t ■ X' 

X 11 '. 



t -X' + bX f -X' = 0 



V' ■ X' 



(9.25) 
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line <7=0 





Fixing the a = 0 line on the closed string world-sheet, after declaring P to be a point with 
or = 0. 

Quick calculation 9.2 Show that the vector v M in (9.25) is timelike. 

The point a = 0 on neighboring strings is declared to be given by X ll (P) + e v 1 ', for 
infinitesimal e. The vector v 1 ' is therefore tangent to the desired a = 0 line at P . The full 
ct= 0 line is constructed by requiring that at each point its tangent be orthogonal to X 1 ' 1 . 
Since the tangent to the a — 0 line is proportional to X 11 , we guarantee that X ■ X' — 0 
along the a — 0 line. Thus we can ensure that X ■ X ' , and consequently n ■ V a , vanishes 
at one point on each string. As a result, we have n ■ V a = 0 everywhere. Equation (9.19) 
can therefore be used both for open and for closed strings: 



n ■ V a = 0 (open and closed strings). 


(9.26) 


We have now concluded the description of the parameterizations of open and closed strings. 
The defining equations, in both cases, are summarized by 


n ■ X(t, a) = j5a' (n ■ p) r, 




2 JT 

n ■ p = — n ■ V r , 

P 


(9.27) 


where 




f 2 for open strings, 

P=\ 

1 1 for closed strings. 


(9.28) 
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Although we were able to define the cr = 0 line on the closed string world-sheet consis- 
tently, our construction has an obvious ambiguity. We had to choose one arbitrary point on 
one string. Any other point on that one string could have been used as a = 0. This means 
that the parameterization of the closed string world-sheet can be shifted rigidly along the a 
direction. There is no way to avoid this ambiguity. The gauge conditions do not fix uniquely 
the parameterization of the closed string world-sheet. This will have implications for the 
theory of closed strings. 



9.3 Constraints and wave equations 



Let us now explore the constraints on X' and X that are implied by our chosen parameter- 
ization. The vanishing of n ■ V n , together with (9.23) and the recognition that 3 z (n ■ X) is 
a nonvanishing constant, leads to 

X ■ X' = 0. (9.29) 



In the static gauge, X — 0, and (9.29) reduces to X ■ X' — 0, which we obtained before 
in (7.1). Equation (9.29) is a constraint that follows from our parameterization. 

We now use (9.29) to simplify the expression (6.49) for V T : 



■ 



I X' 2 X^ 
2TTOL 1 J_X2 X > 2 ' 



(9.30) 



With the help of this result, the second equation in (9.27) gives 



1 X' 2 (n ■ X) 
P a ' s/-X 2 X’ 2 ' 



(9.31) 



Since n ■ X — fia' (n ■ p) (see (9.27)), the factors of cancel, and we find 



X' 2 

s/-X 2 X' 2 



X 2 + X' 2 = 0, 



(9.32) 



where we used X'~ ^ 0. Aside from the units that are now different, this is consistent with 
the earlier result (7.23), which we obtained using the static gauge and involves only spatial 
components. Equations (9.29) and (9.32) are the constraint equations that follow from our 
choice of parameterization. Together they read 



X ■ X' = 0, X 2 + X' 2 = 0. 


(9.33) 


These two conditions are conveniently packaged together as 


(X ± X') 2 = 0. 


(9.34) 
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Note that the constraints take this form for any value of ft in (9.27). Our choices of ft for 
open and closed strings give convenient er ranges, but other choices are possible. 

Given the above constraints, the momentum densities V TI 1 and 'P r ‘ IL simplify considerably. 
To use the constraints to simplify (9.30), we must take the positive square root in the 
denominator. Since X' 2 > 0, using (9.32) gives 

,J-X 2 X' 2 = s/x l 2 X ' 2 = X' 2 . (9.35) 

Back in (9.30) we therefore have 

P T/i = — ' — X (9.36) 

2jta r 



The momentum density V nl ft recorded in (6.50), simplifies down to 



„„ 1 X 2 X^’ 1 X 2 X»' 

'pen — — 

2tux' J —x^x' 2 2.na' X’ 2 

and, using (9.32), 


(9.37) 


pcrfi _ 1 x n' 

2 tux’ 


(9.38) 


The momentum densities are simple derivatives of the coordinates. Using these expressions 
in the held equation d r V TI1 + d a V ai 1 = 0, we find 


x IJ - - X^" = 0. 


(9.39) 



With our parameterization, the equations of motion are just wave equations! Notice that 
the minus sign on the right-hand side of (9.38) was necessary in order to get a wave equa- 
tion. For open strings with free endpoints, the wave equations are supplemented by the 
requirement that the V n ,L , and therefore the X 1 " , vanish at the endpoints. 



9.4 Wave equation and mode expansions 



We will now explicitly solve the wave equation (9.39) in full generality for the case of open 
strings. In doing so we will introduce some of the basic notation that is used in string theory. 
We will assume that we have a space-filling D-brane. As a result, all string coordinates X ! ‘ 
satisfy free boundary conditions at the endpoints. We know that the most general X ll (r. o) 
that solves the wave equation (9.39) is 
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X*{x,a) = !(V(r + a) + ^(r-a)), (9.40) 

where f 1 ' and g 1 ' are arbitrary functions of a single argument. Bearing in mind (9.38), the 
free endpoint boundary conditions V a/l = 0 imply the Neumann boundary conditions 

dX v 

= 0, at a = 0, it. (9.41) 

da 

The boundary condition at cr = 0 gives us 

dX'- 1 1/ x 

— (r, 0) = - (>'(r) - g»'( r)j = 0. (9.42) 

Since the derivatives of f 1 ' and g ,L coincide, f 11 and g 1 ' can differ only by a constant 
c M . After replacing in (9.40), the constant c 11 can be reabsorbed into the 

definition of . The result is 

X^(r,a)= I(/^(r + a) + /' i (r-a)). (9.43) 

Now let us consider the boundary condition at a = tt. 

— (r,7r) = -(/^(r + 7r)-/^(r-7r)J = 0. (9.44) 

Since this equation must hold for all r, we learn that is periodic with period 2 jt. Since 
2tc is a natural period, our decision to parameterize the open string with a e [0, tt] has 
paid off. 

We now write the general Fourier series for the periodic function 

oo 

f^\u) = f[ J ‘ + cosnu + b'n sinnt<Y (9.45) 

n= 1 

Integrating this equation we get the expansion of / M (n): 

OO 

/^(m) = + ff 1 u + Y'(a» cos nu + Bjf sinn^Y (9.46) 

n = 1 

where we have absorbed the constants arising from integration into new coefficients. We 
substitute this expression for f(u) back into (9.43) and simplify to get 

OO 

^(r, a) = /<f + f*x + Y^{^n co&nr + Bn sin m\ cos na. (9.47) 

77=1 



We want to replace the coefficients in equation (9.47) by new coefficients that have 
a simple physical interpretation. Our first step is introducing constants a 1 /, through the 
relations 



A% cos nr + sinnr = ((£,? + iA%)e inT - (B' n l - iA^e~ inr ^ 



. V2cF 



In \ 



K v 



')■ 



(9.48) 
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Here * denotes complex conjugation. The purpose of the y/2a' factor is to make the con- 
stants a 1 ,' dimensionless. These constants, and their complex conjugates, will turn into 
annihilation and creation operators when we consider the quantum theory. Equation (9.48) 
introduces the notation that string theorists conventionally use. 

In equation (9.47) the constant has a simple physical interpretation. Using (9.36), 
the momentum density is given by 



-pT/i _ 



2 not' 



X » = 



2 rca 



7 /f + ••' 



(9.49) 



where the dots denote terms with cos no dependence (n ^ 0). To find the total momentum 
p ,L , we integrate V 11 ' over a e [ 0, n ]. Happily, the terms represented by the dots do not 
contribute as the integral of cos no vanishes. We get 

P 11 = J v^do = Tt/r — > ./f = 2<x’ P r (9.50) 

This identifies f[ L as a quantity proportional to the spacetime momentum carried by the 
string. Declaring f[‘ — x[f , and collecting all the above information, equation (9.47) now 
takes the conventional form 




The terms on the right-hand side clearly correspond to the zero mode, to the momentum, 
and to the oscillations of the string. If all the coefficients a„ of the oscillations vanish, the 
equation represents the motion of a point particle. 

Quick calculation 9.3 Verify explicitly that X^(r, o) is real. 

Let us now introduce some notation that will allow us to write simple expressions for the 
r and o derivatives of X^(r, o). We start by defining 



a(f = V2cdp /J '. 



(9.52) 



Furthermore, we define 



-Jn, a^ n = a%*y/n, n > 1 . 



(9.53) 



It is important to note that 



< = «)*• 



(9.54) 
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Moreover, while the a 1 ,', are only defined when n is a positive integer, the a„ are defined 
for any integer n, including zero. Using these new names, we can rewrite X /l as 

OO . 

X M (r, a) = Xq + V 2a' a {f r — i V 2a' ^ -(^a^ n e ,nr — a^e~ lnT J cos no. (9.55) 

n = 1 



It is convenient to sum over all integers except zero: 



X^(r, a) — x q + V2a' a£ r + iV 2a' J — ajf 



(9.56) 



njt 0 



This completes the solution of the wave equations with Neumann boundary conditions. 
In the above equation, a solution is defined once we specify the constants Xq and aj, 
for n > 0. 

It is convenient to record here the r and a derivatives of X /l . From (9.56) we see that 

X M = V2cF J2 a n cos nae~ inz , (9.57) 

neZ 

X ^ = —iV 2a' ^ sin nae~" lT , (9.58) 

neJj 

where Z denotes the set of all integers (positive, negative, and zero). Finally, two linear 
combinations of the above derivatives are particularly nice: 



X P ± X 11 ' = Vw < e~ in{T±a) . 


(9.59) 







We have found solutions of the wave equations that satisfy the relevant boundary con- 
ditions, but we must also ensure that the constraints (9.33) are satisfied. If we specify 
arbitrarily all constants aj , , the constraints will not be satisfied. We will use the light-cone 
gauge to find a solution that satisfies the wave equations as well as the constraints. 



9.5 Light-cone solution of equations of motion 



The light-cone solution of the equations of motion involves using light-cone coordinates 
to represent the motion of strings, and imposing a set of conditions that defines the light- 
cone gauge. We have seen in Chapter 2 that using light-cone coordinates means using x + 
and x~ instead of x° and x 1 - this is just a change of coordinates. Imposing a light-cone 
gauge condition is a more substantial step. The gauges we have examined in this chapter 
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represent very specific choices of world-sheet coordinates. One of these choices is the 
light-cone gauge. 



Selecting the light-cone gauge means imposing the conditions (9.27) with a vector n ^ that 
gives n ■ X = X + . Taking 



= ( 7 !' 7!' 0 ")■ 



(9.60) 



we indeed find 



n ■ X = 



X° + X l 

~7T~ 



= x+, 



n • p = 



V2 



= P + - 



(9.61) 



Using these relations in (9.27) we have 

X + (r, a) = jia'p + r, 




(9.62) 



where (3 = 2 for open strings and = 1 for closed strings. The second equation tells us 
that the density of p + is constant along the string. 



The strategy behind the light-cone gauge is to use the especially simple form of X + to 
show that there is no dynamics in X~ (up to a zero mode) and that all the dynamics is in 
the transverse coordinates X 2 , X 3 , . . ., X d . These transverse coordinates will be denoted 
by X 1 , where the transverse index I runs from 2 up to d: 

X 1 = (X 2 ,X 3 ,..., X d ). (9.63) 

In order to proceed we look at the constraint equations (9.34). Using the definition (2.59) 
of the relativistic dot product in light-cone coordinates, we can write these constraints as 

- 2(X + ± X+')(X~ ± X~) + (X 1 ± X 1 ') 2 = 0, (9.64) 

where (a 1 ) 2 = a 1 a 1 and, as usual, repeated indices imply summation. Since X + ' = 0 and 
Z + = fia' p + , we in fact have 

X~±X~'= ^-^—(X 1 ±X 1 ') 2 . (9.65) 

pa' 2 p + 

In writing the above we have assumed that p + ^ 0. While p + certainly satisfies p + > 0, 
it can happen that p + is equal to zero. For this, the momentum p 1 must cancel the energy, 
and this can only occur if we have a massless particle traveling exactly in the negative 
x l direction. Since the vanishing of p + is thus not a common occurrence, we will take 
p + to be always positive. If we come across a situation where p + is zero, the light-cone 
formalism will not apply. 

Note the crucial role played by both the choice of light-cone coordinates and the choice 
of light-cone gauge in allowing us to solve for the derivatives of X~ . Light-cone coordi- 
nates were useful because the off-diagonal metric in the (+, — ) sector allowed us to solve 
for the derivatives of X~ without having to take a square root! We just had to divide by 
X + . Here the light-cone gauge was useful, since it made X + equal to a constant. 



188 



Light-cone relativistic strings 



Equations (9.65) determine both X~ and X~ in terms of the X 1 , and therefore they 

determine X~ up to a single integration constant. All that is needed is the value of X~ at 

some point P on the world-sheet and then we can integrate the relation 

8X~ 3X~ 

dX = dr H da , (9.66) 

3r da 

to find the value of X~ at any other point Q. On an open string world-sheet we can choose 
any path from P to Q to perform the integration and the result for X~(Q) will not depend 
on the path, as discussed in Problem 9.2. On a closed string world-sheet there is a further 
consistency condition. Imagine a contour of integration that starts at P and ends at P after 
going around the world-sheet. It is not guaranteed that the integration of dX~ on this 
contour gives zero, a result necessary for X~ to be well-defined. If we choose the contour 
to be at constant r, we must require 

f 271 dx~ 

/ da = 0. (9.67) 

Jo oa 

This is a nontrivial constraint; see Problem 9.5. 

Our analysis indicates that the full evolution of the string is determined by the following 
set of objects: 

X ! (r, a), p + , Xq, (9.68) 

where x q is the constant of integration needed for X~ . 

Let us focus on the case of open strings (/l = 2). We consider the explicit solution for 
the transverse coordinates X 1 and calculate the associated X~ . Making use of the general 
solution in (9.56) we have 

X 1 (r, a) = x o + y/2a' a^x + i V 2a' ^ —a I n e~ mx cos na. (9.69) 

n# 0 n 

Moreover, for the X + coordinate the gauge condition gives 

X + (r. a) — 2a' p + r = s/lcd ajj" r. (9.70) 

As we can see, the position zero mode and the oscillations of the X + coordinate have been 
set to equal to zero: 

x+ = 0, «+ = a\ = 0, n = 1,2,..., oo. (9.71) 

What about X~1 Being a linear combination of X° and X 1 , the coordinate X~ satisfies the 
same wave equation and the same boundary conditions as all the other coordinates. We can 
therefore use the same expansion as in (9.56) to write 

X~(r, a) = Xq + V2od <XqX + iV2od ^ - a~e~ l " r 

njto n 



cos na. 



(9.72) 
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Using equation (9.59) with /x = — and p. = I, we find 



X~ ± X~' = e~ in(T±a \ 

neZj 


(9.73) 


X 1 ± X 1 ' = a n e- in(z±a \ 

neJj 


(9.74) 



We use these equations and (9.65) to solve for the minus oscillators: 

= — — V a 1 a 1 e ~'" (r±a) 

2 n+ u P u n-pC 

«,/?eZ 






— iw(r±0r) 



2p+ 



/?eZ 



(9.75) 



It follows that we can identify a n consistently as 

V2a’a~ = — ai-pftp- (9.76) 

P peZ 

This represents a complete solution! We now have explicit expressions for the minus 
oscillators a~ in terms of the transverse oscillators. On the right-hand side the spacetime 
indices are only to be summed over the labels of the transverse coordinates. 

The general solution which represents an allowed motion is fixed by specifying arbitrary 
values for p + , Xq , x^, and for all the constants a,{. This clearly determines X 1 (x, <r) in 
(9.69), and X+(x, a) in (9.70). Using (9.76) we can calculate the constants a n , which 
together with xX determine X~(x, a) in (9.72). The full solution is thus constructed. 



The quadratic combination of oscillators on the right-hand side of (9.76) is remarkably 
useful, so it has been given a name. It is the transverse Virasoro mode L^: 



= 

P 


1 \ i i 

2 I^ a n-P a p- 


(9.77) 




pel 




In particular, for n = 0 we use (9.52) and find 


V 2a' a Q = 2a' p = Lq — > 


• 2p + p~ = — Lq . 

cr 


(9.78) 



Using the value of a n given in (9.77), equations (9.73) and (9.65) are written as 
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Quick calculation 9.4 Show that 

X~(r, cr) = xX -LaT H — — -L^re~ lnT cosner. (9.80) 

P + P + n 

‘ F njH) 

This equation explicitly demonstrates the claim that the Virasoro modes are the expansion 
modes of the coordinate X~(r, cr). 



It is instructive to compute the mass of a string which is performing an arbitrary motion. 
The mass can be calculated using the relativistic equation 

M 2 = -p 1 = 2p + p~ - p'p 1 . (9.81) 



Since the mass is a constant of the motion, we anticipate that it depends on the constant 
coefficients introduced to define a classical solution. To evaluate the mass we start with 
(9.78), substitute the value of from (9.77), and use the definitions in (9.52) and (9.53): 



111 00 i oo 

2p+p ~ = ^ L 0 =^ ( 2 “ 0«0 + E “»*“») = p'p' + ^7 E' 

n = 1 n = 1 



n a!* a I . 



(9.82) 



Replacing this result on the right-hand side of (9.81) we finally find 



M 2 




n = 1 




(9.83) 



This is a very interesting result. The mass-squared is written as a sum of terms each of 
which is of the form a* a = \a\ 2 > 0. So, we find that M 2 > 0. This shows that the classical 
string mass M = V M 2 is a real number (conventionally taken to be positive). Such a result 
is actually hard to obtain without using the light-cone gauge. We also see that we can 
adjust the coefficients a[ t to obtain classical string solutions with arbitrary values of the 
mass. If all the coefficients a j t vanish, the result is a massless object M 2 = 0. Indeed, when 
all a ! n vanish the string collapses to a moving point: equation (9.69) gives X 1 (r, a) = 
Xq + y/2a' c/qT, and the a dependence disappears. 

Quick calculation 9.5 Calculate X~ (r, a) when all aj, vanish. Note that the a dependence 
of X~ disappears. 

The classical result (9.83) for M 2 does not survive quantization. First, M 2 will become 
quantized, and string states will not exhibit a continuous spectrum of masses. This is good 
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because we do not observe in nature particle states that take continuous values of the mass. 
Even more, equation (9.83) does not give enough massless states. The few massless states 
that are obtained do not behave at all like the massless states of Maxwell theory. For closed 
strings, the few massless states that are obtained by a similar analysis do not behave at 
all like the massless states of gravitation. Quantum mechanics will add an extra constant 
to the formula for M 2 , both for open and for closed strings. These additive constants will 
enable us to find states that correspond to those of physical theories. String theory has a 
chance to describe gauge fields and gravity because quantization changes (9.83) and the 
corresponding formula for closed strings. 



Problems 



Problem 9.1 Vectors orthogonal to null vectors are null or spacelike. 

Let n IL be a nonzero null vector = 0) in D -dimensional Minkowski space. In 

addition, let b 1 ' be a vector that satisfies n^ h 1 ' = 0. Prove the following. 

(a) The vector b 1 ' is either spacelike or null. 

(b) If //' is null, then b 1 ' = ),n /J for some constant /.. 

(c) The set of vectors b 11 that satisfies n^b 1 ' = 0 is a vector space V of dimension (D — 1). 
The subset of null vectors b 11 is a vector subspace of V of dimension one. 

This result shows that for gauges (9.2) with n 1 ' null and for D > 2, strings are almost 
always spacelike objects. Moreover, the hyperplane orthogonal to n 1 ' contains n 11 . This is 
readily confirmed in two dimensions: 

(d) Let D = 2 and consider a spacetime diagram such as the one in Figure 2.2. What is 
the null vector n !> such that n ■ X = X + 1 Confirm that n 1 ' points along the lines of 
constant X + . 

Problem 9.2 Consistency checks on the solution for X~ . 

(a) Use (9.65) to find d T X~ and d a X~. Show that the consistency condition d a (d T X~) = 
3 r (3 crX~) holds if the transverse coordinates X 1 satisfy the wave equation. Prove that 
this condition guarantees that X~ , determined by integration of (9.66), is independent 
of the chosen path. 

(b) Show that X~ , as calculated in (9.65), satisfies the wave equation if the transverse 
coordinates X 1 satisfy the wave equation. 

(c) Assume that at the open string endpoints some of the transverse light-cone coordinates 
X 1 satisfy Neumann boundary conditions and some satisfy Dirichlet boundary condi- 
tions. Prove that X~, as calculated in (9.65), will always satisfy Neumann boundary 
conditions. 

Problem 9.3 Rotating open string in the light-cone gauge. 

Consider string motion defined by x g — xL — 0, and the vanishing of all coefficients a ! n 
with the exception of 
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(2) (2)* 
aq = a_[ 



'■ a. 



„( 3 ) 



( 3 )* 

a_j — la. 



(1) 



Here a is a dimensionless real constant. We want to construct a solution that represents an 
open string that is rotating on the (x , x 3 ) plane. 



(a) What is the mass (or energy) of this string? 

(b) Construct the explicit functions X 2 (r, a) and X 2 (t, a). What is the length of the string 
in terms of a and a'? 

(c) Calculate the modes for all n . Use your result to construct X~(r, a). Your answer 
should be a -independent! 

(d) Determine the value of p + using the condition that for this string X 1 (r, er) = 0. Find 
the relation between t and r . 

(e) Confirm that in your solution the energy of the string and its angular frequency of 
rotation are related to its length as in (7.59). 



Problem 9.4 Generating consistent open string motion: How does an open string collapse? 

Consider string motion defined by xC = xL = 0, and the vanishing of all coefficients a 2 
with the exception of 

(2) (2)* 
a j = of j = a. 

Here a is a dimensionless real constant. We want to construct a solution that represents an 
open string oscillating on the (x 1 , x 2 ) plane and having zero momentum in this plane. 



(a) Show that the string motion is described by 
1 1 



*j2a' ci 
1 1 
\/2a' ci 
1 1 



= — X°(r, er) = V2 (r H — sin2r cos2erV 
'a v 4 / 



X l (T ,<X) = - 



1 



2V2 



sin 2r cos 2a, 



V2a' ci 



— X (x, a) = 2 sin r cos a. 



(b) Confirm that r flows as t flows. In the chosen Lorentz frame, strings are lines on the 
world-sheet that lie at constant time X°. Find the values of r for which constant r lines 
are strings. Describe those strings. 

(c) At r = 0 the string has zero length. Study in detail the motion for r 1. Calculate 
r = r(f, a) and use this result to find X l (t, a) and X 2 (t, a). Prove that as the string 
expands from zero size, it lies on the portion cos 6 > — 1/3 of a circle centered at the 
origin, whose radius grows at the speed of light (6 is measured with respect to the 
positive x 1 axis). Note that the endpoints move transversely to the string. 

(d) Use your favorite software package to do a parametric plot of the string world-sheet as 
a surface in three dimensions. Use X , X 2 . and X t} as x, y, and z axes, respectively, 
and parameters r and a. For further help visualizing the motion of the string, plot the 
string on the (x 1 , x 2 ) plane at various values of the time X (i . This requires solving 
(numerically) for r as a function of X° and a . 
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Problem 9.5 A closed string in the light-cone gauge. 

Consider a closed string for which 

X®(r, a) = J2a' (ci sin(r — er) + b cos(r — a) + a sin(r + a) + b cos(r + er)^, 

and all other transverse coordinates X 1 (r, o) vanish. In the above, a , b, a, and b are real 
constants. Note that a and b are the coefficients of waves that propagate towards increasing 
a while a and b are the coefficients of waves that propagate towards decreasing a . As usual 
for closed strings, we set X + (r, <r) = a' p + r and use a e [0, 27 t], 

(a) As it turns out, it is not possible to generate a solution of the equations of motion for 
completely arbitrary values of the constants a, b, a, and b. Examine the calculation of 
X~(r, a) and derive the constraint that the constants must satisfy. 

(b) Your constraint must allow a = b = a — b = r. Use these values to calculate 
X~(x, a) and to determine the mass of this string. 



r 

10 Light-cone fields and particles 

L 



We study the classical equations of motion for scalar fields. Maxwell fields, and 
gravitational fields. We use the light-cone gauge to find plane-wave solutions to 
their equations of motion and the number of degrees of freedom that character- 
ize them. We explain how the quantization of such classical field configurations 
gives rise to particle states - scalar particles, photons, and gravitons. In doing 
so we prepare the ground for the later identification of such states among the 
quantum states of relativistic strings. 



( 



10.1 Introduction 



In our investigation of classical string motion we had a great deal of freedom in choosing 
the coordinates on the world-sheet. This freedom was a direct consequence of the reparam- 
eterization invariance of the action, and we exploited it to simplify the equations of motion 
tremendously. Reparameterization invariance is an example of a gauge invariance, and a 
choice of parameterization is an example of a choice of gauge. We saw that the light-cone 
gauge - a particular parameterization in which r is related to the light-cone time X + and a 
is chosen so that the /t + -density is constant - was useful to obtain a complete and explicit 
solution of the equations of motion. 

Classical field theories sometimes have gauge invariances. Classical electrodynamics, for 
example, is described in terms of gauge potentials A^. The gauge invariance of this 
description is often used to great advantage. The classical theory of a scalar field is sim- 
pler than classical electromagnetism. This theory is not studied at the undergraduate level, 
however, because elementary scalar particles - the kind of particles associated with the 
quantum theory of scalar fields - have not been detected yet. On the other hand, photons - 
the particles associated with the quantum theory of the electromagnetic field - are found 
everywhere! Scalar particles may play an important role in the Standard Model of particle 
physics, where they can help trigger symmetry breaking. Thus physicists may detect scalar 
particles some time in the future. The field theory of a single scalar field has no gauge 
invariance. We will study it because it is the simplest field theory and because scalar parti- 
cles arise in string theory. The most famous scalar particle in string theory is the tachyon. 
Also important is the dilaton, a massless scalar particle. 

Einstein’s classical field theory of gravitation is more complicated than classical elec- 
tromagnetism. In gravity, as we explained in Section 3.6, the dynamical variable is the 
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two-index metric field g fl v (x ) . Gravitation has a very large gauge invariance. The gauge 
transformations involve reparameterizations of spacetime. 

We will consider scalar fields, electromagnetic fields, and gravitational fields. The light- 
cone gauge will allow us to simplify dramatically the (linearized) equations of motion, find 
their plane-wave solutions, and count the number of degrees of freedom that characterize 
the solutions. We will also briefly consider how the quantization of plane-wave solutions 
gives rise to particle states. These are the quantum states associated with the field theories. 
We quantize the relativistic string in Chapter 12. There we relate the quantum states of the 
string to the particle states of the field theories that we study in the present chapter. We use 
the light-cone gauge here because we will quantize the relativistic string in the light-cone 
gauge. 



A scalar field is simply a single real function of spacetime. It is written as (p{t, x) or, more 
briefly, as <p(x). The term scalar means scalar under Lorentz transformations: all Lorentz 
observers will agree on the value of the scalar field at any fixed point in spacetime. Scalar 
fields have no Lorentz indices. 

Let us now motivate the simplest kind of action principle that can be used to define the 
dynamics of a scalar field. Consider first the kinetic energy. In mechanics, the kinetic 
energy of a particle is proportional to its velocity squared. For a scalar field, the kinetic 
energy density T is declared to be proportional to the square of the rate of change of the 
field with time: 



We speak of densities because, at any fixed time, T is a function of position. The total 
kinetic energy will be the integral of the density T over space. 

Now consider the potential energy density. There is one class of term that is natural. 
Suppose the equilibrium value of the field is (p = 0. For a simple harmonic oscillator with 
equilibrium position x = 0, the potential energy goes like V ~ x 2 . If we want the held 
to prefer its equilibrium state, then this must be encoded in the potential. The simplest 
potential which does this is quadratic: 



It is interesting to note that the constant m introduced here has the units of mass. Indeed, 
since the expressions on the right-hand side of the two equations above must have the same 
units ([7’] = [V]), and both have two factors of <p, we require [m] = [3o] = /A 1 = M. 

We could now attempt to form a Lagrangian density by combining the two energies above 
and setting 
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T = Oo <P) 2 - 



( 10 . 1 ) 




( 10 . 2 ) 




(10.3) 
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This Lagrangian density, however, is not Lorentz invariant. The second term on the right- 
hand side is a scalar, but the first term is not, for it treats time as special. We are missing 
a contribution representing the energy cost when the scalar field varies in space. This is 
eminently reasonable in special relativity: if it costs energy to have the field vary in time, 
it must also cost energy to have the field vary in space. The extra contribution is therefore 
associated with spatial derivatives of the scalar field, and can be written as 

V' = \ = ^W) 2 - ( !0- 4 ) 

i 

where 3/ are derivatives with respect to spatial coordinates. We have written this term as 
a new contribution V' to the potential energy, rather than as a contribution to the kinetic 
energy. There are several reasons for this. First, it is needed for Lorentz invariance. The 
two options lead to contributions of opposite signs in the Lagrangian, and only one sign 
is consistent with Lorentz invariance. Second, kinetic energy is always associated with 
time derivatives. Third, the calculation of the total energy vindicates the correctness of the 
choice. Indeed, with this additional term the Lagrangian density becomes 

c = T - V' - V = - 3o0 d 0 4> - - d,4> 3 ,-</> - - m 2 </) 2 , (10.5) 

where the repeated spatial index i denotes summation. The relative sign between the first 
two terms on the right-hand side allows us to rewrite them as a single term which uses the 
Minkowski metric rj llv : 

£ = -^A^3i-^V. (10.6) 

Since all the indices are matched, the Lagrangian density is Lorentz invariant. The 
associated action is 

5 = n ,lV d^dv<P - \m 2 ct > 2 ), (10.7) 

where d D x — dx°dx 1 . . . dx d , and D — d + 1, is the number of spacetime dimensions. 
This is the action for a free scalar field with mass m . A field is said to be free when its 
equations of motion are linear. If each term in the action is quadratic in the field, as is the 
case in (10.7), the equations of motion will be linear in the field. A field that is not free is 
said to be interacting, in which case the action contains terms of order three or higher in 
the field. 



To find the energy density in this field we calculate the Hamiltonian density TL. The 
momentum n conjugate to the field is given by 



n = 



3 C 



3(3o0) 



( 10 . 8 ) 



where we used (10.5) to evaluate the derivative. The Hamiltonian density is then con- 
structed as 



n = n3 0 </> - c. 



(10.9) 



197 



10.3 Classical plane-wave solutions 



Quick calculation 10.1 Show that the Hamiltonian density takes the form 

H= \ n 2 +\^ ( l>j 2 + \m 2 (t> 2 - ( 10 . 10 ) 

The three terms in 7i are identified as T, V', and V, respectively. This is what we expected 
physically for the energy density. The total energy E is given by the Hamiltonian H, which 
in turn, is the spatial integral of the Hamiltonian density TL~. 

E = H = J d d x(^d Q 4>do<p+^(V<p) 2 + ^m 2 <p 2 y (10.11) 

To find the equations of motion from the action (10.7), we consider a variation Sep of 
the field and set the variation of the action equal to zero. After discarding a total derivative 
we find 

8S = J d D x(-r j ' iv d fl (8(l))d v <p - i n 2 <p8$J = J d D x Scp^d^ip - m 2 &j = 0 . 

( 10 . 12 ) 

The equation of motion for <p is therefore 

?7 /iV 3 Al 3 v 0-m 2 0 = O. (10.13) 

If we define 3 2 = f7 A ‘ v 3 |tt 3 u , then we have 

(3 2 — m 2 ) (f> = 0 . (10.14) 

Separating out time and space derivatives, this equation is recognized as the Klein-Gordon 
equation: 

3 2 (p 7 ~ 

y + V 2 </>-/« 2 0 = O. (10.15) 

3 t z 

We will now study some classical solutions of this equation. 



10.3 Classical plane-wave solutions 



We can find plane-wave solutions to the classical scalar field equation (10.15). Consider, 
for example, the expression 

</>(t,x) = ae~ iEt+i P- i , (10.16) 

where a and E are constants and p is an arbitrary vector. The field equation (10.15) fixes 
the possible values of E in terms of p and m: 



E 2 - p 2 -m 2 = 0 



E = ±E p , E p = 



p 2 + m 2 . 



(10.17) 
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The square root is chosen to be positive, so E p > 0. There is a small problem with the 
solution in (10.16). While 0 is a real held, the solution (10.16) is not real. To make it real, 
we just add to it its complex conjugate: 

4>(t, x) = a e - iE pt+iH + a * e iEpt—ip x _ (10 . 18 ) 

This solution depends on the complex number a. A general solution to the equation of 
motion (10.15) is obtained by superimposing solutions, such as those above, for all values 
of p. Since p can be varied continuously, the general superposition is actually an integral. 
The classical held does not have a simple quantum mechanical interpretation. If the two 
terms above were to be thought of as wavefunctions, the hrst would represent the wavefunc- 
tion of a particle with momentum p and positive energy E p . The second would represent 
the wavefunction of a particle with momentum (— p) and negative energy (— E p ). This is 
not acceptable. To do quantum mechanics with a classical held one must quantize the held. 
The result is particle states with positive energy, as we will discuss briefly in the following 
section. 



An analysis of the classical held equation (in a practical way that applies elsewhere) uses 
the Fourier transformation of the scalar held <p(x): 



(p(x) = 



J 



d D p 

(2 7t) D 



e ipx H P ). 



(10.19) 



Here <p{p) is the Fourier transform of <fi (x ) . We will always show the argument of (j) so 
no confusion should arise between the spacetime held and the momentum-space held. 
Note that we are performing the Fourier transform over all spacetime coordinates, time 
included: p ■ x — —p°x° + p ■ x. The reality of <p(x) means that <fi(x) = (0(x))*. Using 
equation (10.19), this condition yields 



/ 



d D p 



e' p x 4>(p) = 



f 



d D 



P - 






ip-x 



(<P(p))* 



( 10 . 20 ) 



We let p — > — p on the left-hand side of this equation. This change of integration variable 
does not affect the integration / d D p, and results in 

C 

J e ~ iP ' X (<K-F) - (0(F))*) = 0 , (10.21) 

where we collected all terms on the left-hand side. This left-hand side is a function of x that 
must vanish identically. It is also the Fourier transform of the momentum-space function 
in parentheses. This function must therefore vanish: 



(<P(p))* = H-p) ■ 



(10.22) 



This is the reality condition in momentum space. 

Substituting (10.19) into (10.14) and letting 3 2 act on e lp x we find 



(10.23) 
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The mass-shell hyperboloid £ 2 - p 2 = m 2 (only two spatial directions shown). The equation 
of motion sets the mass m scalar field <f>(p) equal to zero away from the hyperboloid. On 
the hyperboloid the field is arbitrary up to a reality condition that relates the field values 
at antipodal points. 



Since this must hold for all values of x , this equation requires 

(p 2 + m 1 ) <p(p) = 0 for all p . (10.24) 

This is a simple equation: (p 2 + m 2 ) is just a number multiplying <j)( p) and the product 
must vanish. Solving this equation means specifying the values of 4>{p) for all values of p. 
Since either factor may vanish we must consider two cases. 

(i) p 2 + m 2 f 0. In this case the scalar field vanishes: f ( p ) = 0. 

(ii) p 2 + m 2 = 0. In this case the scalar field 0 ( p ) is arbitrary. 

In momentum space the hypersurface p 2 + m 2 = 0 is called the mass-shell. With p 11 = 
( E , p), the mass-shell is the locus of points in momentum space where E 2 = p 2 + m 2 , 
the hyperboloid sketched in Figure 10.1. The mass-shell is therefore the set of points 
(± E p , p), for all values of p. We have learned that 0( p) vanishes off the mass-shell and 
is arbitrary (up to the reality condition) on the mass-shell. 

We now introduce the idea of classical degrees of freedom. For a point p 11 on the mass- 
shell, the solution is determined by specifying the complex number (p(p). This number 
determines as well the solution at the point (—p 1 *), also belonging to the mass-shell: 
(p(—p) = Uj)( p))* . So, a complex number fixes the values of the field for two points on 
the mass-shell. We need, on average, one real number for each point on the mass-shell. We 
will say that a field satisfying equation (10.24) represents one degree of freedom per point 
on the mass-shell. 

We conclude this section by writing the scalar field equation of motion in light-cone 
coordinates. Let xj denote a vector whose components are the transverse coordinates x 1 : 

XT — (X 2 , X 3 , . . ., X d ) . 



(10.25) 
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In this notation, the collection of spacetime coordinates becomes (x + , x , xj). Equation 
(10.14) expanded in light-cone coordinates is 

/ 3 3 33 7 \ , _ _ 

+ )*° t •*~'- ,r) = 0 ' (10 ' 26) 
To simplify this equation, we Fourier transform the spatial dependence of the field, chang- 
ing x~ into p + and x 1 into p 1 . Letting pj denote the vector whose components are the 
transverse momenta p 1 

p T = (p 2 ,p\...,p d ), (10.27) 

the Fourier transform is written as 

<t>(x + , X -,x T ) = J^-J d {2 J -2 e- ix ~P ++rxT P T Hx + ,P + ,p T ). (10.28) 

We now substitute this form of the scalar field into (10.26) to get 

(-2 ^(~iP + ) ~ - '« 2 ) 0(* + , P + , PT ) = 0, (10.29) 

and dividing by 2 p + we find 

(* - Y^+(p' P 1 +'” 2 )) F + ’ Pt) = 0. (10.30) 

This is the equation we were after. As opposed to the original Lorentz covariant equation 
of motion, which has second-order derivatives with respect to time, the light-cone equation 
is a first-order differential equation in light-cone time. Equation (10.30) has the formal 
structure of a Schrodinger equation, which is also first order in time. This fact will prove 
useful when we study how the quantum point particle is related to the scalar field. 

Another version of equation (10.30) will be needed in our later work. Using a new time 
parameter r related to x + as x + — p + r/m 2 we obtain 

(i ^ - ^(P 1 P 1 + m 2 i) (p(r, p + , p T ) = 0. (10.31) 



Quick calculation 10.2 Consider the mass-shell condition p 2 + m 2 — 0 in light-cone 
coordinates. Show that 



P 



1 

2 P+ 



(p 1 p 1 + m 2 ). 



(10.32) 



10.4 Quantum scalar fields and particle states 



Quantum field theory is a natural language to describe the quantum behavior of elementary 
particles and their interactions. Quantum field theory is quantum mechanics applied to clas- 
sical fields. In quantum mechanics, classical dynamical variables turn into operators. The 
position and momentum of a classical particle, for example, turn into position and momen- 
tum operators. If our dynamical variables are classical fields, the quantum operators will 
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be field operators. Thus, in quantum field theory, the fields are operators. The state space 
in a quantum held theory is typically described using a set of particle states. Quantum held 
theory also has energy and momentum operators. When they act on a particle state, these 
operators give the energy and momentum of the particles described by the state. 

In this section, we examine briehy how the features just described arise concretely. 
Inspired by the plane-wave solution (10.18), which describes a superposition of complex 
waves with momenta p and —p (with p 0), we consider a classical held configuration 
<p p (t , x) of the form 



There have been two changes. First, the time dependence has been made more general by 
introducing the function a(t ) and its complex conjugate a*(t). The function a(t) is the 
dynamical variable that determines the held conhguration. Second, we have placed a nor- 
malization factor yfv, where V is assumed to be the volume of space. The normalization 
factor also includes a square root of the energy E p dehned in equation (10.17). 

We can imagine space as a box with sides L\, L 2 , . . Lj, in which case V = 
L\L 2 ■ ■ . Ld- When we put a held on a box we usually require it to be periodic. The held 
<p p is periodic if each component p, of p satishes 



Here the n ,■ are integers. Each component of the momentum is quantized. 

We now try to do quantum mechanics with the held conhguration (10.33). To this end, 
we evaluate the scalar held action (10.7) for </> = fi p (t , x): 



The evaluation involves squaring the held, squaring its time derivative, and squaring its 
gradient. In squaring any of these, we obtain from the cross multiplication two types of 
terms: those with spatial dependence exp(±2 / p ■ x ) and those without spatial dependence. 



these terms cannot contribute. Indeed, the quantization conditions in (10.34) imply that 



For the terms without spatial dependence, the spatial integral gives a factor of the volume 
V , which cancels with the product of \[V factors we introduced in (10.33). The result is 




(10.33) 



Pi L, = Inrii , i = 1, 2, . . ., d . 



(10.34) 




(10.35) 



We claim that the spatial integral f d d x of the terms with spatial dependence is zero, so 




(10.36) 




(10.37) 



Quick calculation 10.3 Verify that equation (10.37) is correct. 



Similarly, we use equation (10.1 1) to evaluate the held energy H: 




(10.38) 
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Quick calculation 10.4 Verify that equation (10.38) is correct. 

The action (10.37) describes the dynamics of two simple harmonic oscillators. Since a(t) 
is a complex dynamical variable we can write 



a(t) = qi(t) + i q 2 (t), (10.39) 

where q\ ( t ) and qi{t) are real coordinates. In terms of q\ ( t ) and qiit). the action becomes 

2 

S = J dtL = ]rJ dt(^~ qf (t ) - ' 2 E p qf(t )) . (10.40) 



We see here that q\(t) and < 72 ( 1 ) are indeed the coordinates of identical simple harmonic 
oscillators. Their canonical momenta are 

dL q;(t) 1 

Pi(t) = — = ^ — > Pl (t) + ip2(t) = — — a(t) . (10.41) 

oqi E p E p 

The equations of motion that follow from the variation of the action (10.40) are 

qi(t) = ~E~ p qi(t) , i = 1,2. (10.42) 

It is actually convenient to work with the complex combination a(t). Using (10.39) the 
equations of motion become the single complex equation 

a(t) = ~E 2 p a(t) . (10.43) 

Equation (10.43) is readily solved in terms of exponentials. Since it is a second-order 
equation, there are two solutions: 

a(t) = a p e~ iE p' + a*_ p e iE r’ . (10.44) 



There is no reality condition since a(t) is complex. In writing the above solution we 
introduced two independent complex constants a p and a*_ p . Substituting this solution into 
(10.38), we find 



H = 



\ a p a p 









(10.45) 



Note that the time dependence disappeared; the energy is conserved. 

In classical scalar field theory there is an integral expression that gives the spacetime 
momentum P carried by the field. In the present case, this field momentum is conserved, 
and a formula can be obtained from the analysis of Problem 8.10. We will not discuss this 
here, but the answer is 

P = - j d d x{d 0 (j))S/<p . (10.46) 



In Problem 10.1 you are asked to evaluate P for the field configuration (10.33), when a(t) 
is given by (10.44). The result that you will find is 





(10.47) 
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The system at hand consists of two harmonic oscillators. The expression for H suggests 
that in the quantum theory a p and a- p become annihilation operators, while a* and a*_ p 
become the creation operators a p and a'_ p , respectively. We will verify the correctness of 
this assumption shortly. These oscillators are required to satisfy the standard commutation 
relations: 

[a p ,al]=l, [a- p ,al p ]=l. (10.48) 

Any commutator which involves an operator with subscript p and an operator with sub- 
script (— p) is declared to vanish. The variable a(t) in (10.44) now becomes an operator. 
We record its form, together with that for its Hermitian conjugate a (t), as well as those 
for their time derivatives: 



a(t) = a p e ,Ept + aL p e ,Ept , 

a'it) = a p e lEpt + a- p e~ ,Ept , 

a(t ) = — iE p {a p e~ ,Ept — a^_ p e lEpt ), 

a^(t) — iE p [a' p e iEpt -a- p e~ iEpt ). (10.49) 

It takes a short computation to verify that the only nonvanishing commutators among 
a(t), fl^(f), a(t), and a^(t) are 




aHt) 



[a f (0 , a(o] 



= 2/ E „ 



(10.50) 



We can now show that these commutation relations imply the commutation relations that 
we would naturally impose between the coordinates and their corresponding conjugate 
momenta: 



[q l(t), pi(t)] = Pi(t)] = i . (10.51) 



To do this, we can use equations (10.39) and (10.41) to solve for the coordinates and the 
momenta: 

qi(t) = Ua(t) + a\t)) , Pl (f) = (a(t) + d f (t)) , (10.52) 

2 2 E p 

©(0=2(a(f)-flt ( o )l (10.53) 

2 1 2 iE p 

We can now check that, for example, 



[<7i(t), pi(t)] 



1 

4 T p 



|^a(t) + a 1 (t ) , a(t) +a^(t) 



1 

4Ep 



(2 iE p + 2 iE p ) — i , 



(10.54) 

as expected. The other commutators can be checked similarly, thus confirming that the 
postulated commutation relations (10.48) are correct. 



Quick calculation 10.5 Check that [q 2 (t), P 2 (t) ] — i and that [qi(t), P 2 (t) ] = 0. 



At the quantum level, the Hamiltonian (10.45) becomes the operator 

H = E p (a ] p a p + aL p a-p \ , 



(10.55) 
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describing a pair of simple harmonic oscillators, each of which has frequency E p . The 
momentum (10.46) becomes the operator 

P = p (al p a p - aLpd-p'j . (10.56) 



Note that the oscillators with subscripts (— p) contribute with a negative sign to the 
momentum. This equation will help our interpretation below. 

The first two equations in (10.49) can be used back in (10.33) to obtain the operator 
version of the field configuration: 



<j> p (t,x) = 



1 1 

VV j2Ep • 
1 1 
Vv T p 



\a„e p 



-iEpt+ip-x i t 



+ a' p e‘ 



iEpt-ip-x^j 



(a- p e~ iE P t ~ i P * + alp e iE p ,+i P*\ . 



(10.57) 



We see that <p p is in fact a spacetime dependent operator, or a field operator. The second 
line in (10.57) is obtained from the first line by the replacement p — > —p, which does not 
affect E p . In full generality, the quantum field </>(x) includes contributions from all values 
of the spatial momentum p: 



* UJ) = 7v^7k' 



t e -iE p t+ip x e iEpt—ip-x 



)■ 



(10.58) 



The commutation relations for the oscillators are the natural generalization of those in 
(10.48): 



[a p , a\ ] = S Pi k , [a p , ak] = W p , a\] = 0. (10.59) 



All subscripts here are spatial vectors, written without the arrows to avoid cluttering the 
equations. The Rronecker delta S p j. is zero unless p — k, in which case it equals one. Once 
we consider contributions from all values of the momenta, the previous expression for the 
Hamiltonian in (10.55) and that for the momentum operator in (10.56) must be changed. 
One can show that 

H = Ep a^ap , 

P 

p = T,p a l a p ■ 

p 

We will not derive these expressions, but they should seem quite plausible 

The state space of this quantum system is built in the same way as the state space of the 
simple harmonic oscillator. We assume the existence of a vacuum state | Q ) , which acts just 



(10.60) 

(10.61) 
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as the simple harmonic oscillator ground state |0> in that it is annihilated by the annihilation 
operators a p : a p \Q,) = 0 for all p. It follows that Ii\Q) =0, which makes the vacuum a 
zero-energy state. This vacuum state is interpreted as a state in which there are no particles. 
On the other hand, the state 

a\ | £2) (10.62) 

is interpreted as a state with precisely one particle. We claim that it is a particle with 
momentum p. To verify this, we act on the state with the momentum operator (10.61) and 
use (10.59) to find 

Pal | £2} = > a p] I £2) = pal |£2) . (10.63) 

k 

The energy of the state is similarly computed by acting on the state with the Hamilto- 
nian H: 

Hal l n > = J2 Ek a l [ak ’ a P ] \n) = E p al\£2). (10.64) 

k 

The state aj, | £2) has positive energy. While the quantum field operator has both positive and 
negative energy components, the states that represent the particles have positive energy. 
The states a p |£2) are the one-particle states. 

The state space contains multiparticle states as well. These are states built by acting on 
the vacuum with a collection of creation operators: 

4 1 fl «--- fl ai£2>- ao.65) 

This state, with k creation operators acting on the vacuum, represents a state with k par- 
ticles. The particles have momenta p\ , p 2 , ■ ■ pk and energies E pi , E P2 , . . ., E pk . The 
various momenta p, need not be all different. 

Quick calculation 10.6 Show that the eigenvalues of P and H acting on (10.65) are 
E*=i Pn and EL i E P „, respectively. 

Quick calculation 10.7 Convince yourself that (V = E/l a J’ a p ' s a number operator: 
acting on a state it gives us the number of particles contained in the state. 

Our analysis of classical solutions in the previous section led to the conclusion that there 
is one degree of freedom per point on the mass-shell. At the quantum level, we focus 
on the one-particle states. Consequently, we restrict ourselves to the physical part of the 
mass-shell, the part where the energy is positive (p° = E > 0). We have a single one- 
particle state for each point on the physical mass-shell. This state is labeled by its spatial 
momentum p. 

To describe the particle states in light-cone coordinates, the changes are minimal. The 
physical part of the mass-shell is parameterized by the transverse momenta pj and the 
light-cone momenta p + for which /; + > 0. The value of the light-cone energy p~ is then 
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fixed. Thus, instead of labeling the oscillators with p, we simply label them with p + and 
pr . The one-particle states are written as 

One-particle states of a scalar field: a p+ PT l^>- (10.66) 



The momentum operator given in (10.61) has a natural light-cone version. The various 
components of the operator take the form 

P + = P + a l+,p T a p + ,PT’ 

P + ,PT 

P 1 = J2 P 1 a l+, PT a P + . PT’ (m67 > 

P + ,PT 

P~ = ^r( pIpI +m2 ) a l+. pr a P + -PT- 

P + ,PT 

In the last equation, the factor which multiplies the oscillators is the value of p~ determined 
from the mass-shell condition (10.32). This last equation is analogous to (10.60), where E p 
is the energy determined from the mass-shell condition. 



10.5 Maxwell fields and photon states 



We now turn to an analysis of Maxwell fields and their corresponding quantum states. As 
opposed to the case of the scalar field, where there is no gauge invariance, electromagnetic 
fields have a gauge invariance that will make our analysis more subtle and interesting. In 
order to study the field equations in a convenient way, we will impose the gauge condition 
that defines the light-cone gauge. We will then be able to describe the quantum states of 
the Maxwell field. 

The field equations for electromagnetism are written in terms of the electromagnetic 
vector potential Ap (x). As we reviewed in Section 3.3, the field strength F /lv = dpA v — 
3„ Ap is invariant under the gauge transformation 

SAp = dpt , ( 10 . 68 ) 

where e is the gauge parameter. The field equations take the form 

3 l; F I£V = 0 — * 3 v (3'M y - 3 y A^) = 0, (10.69) 

and can be written as 

3 2 A^ — 3^(3 • A) = 0. (10.70) 

Compare this equation with equation (10.14) for the scalar field. There is no indication 
of a mass term for the Maxwell field - such a term would be recognized as one without 
spacetime derivatives. We will confirm below that the Maxwell field is, indeed, massless. 
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We Fourier transform all the components of the vector potential in order to determine 
the equation of motion in momentum space: 

A ^ X) = f ~^ e ‘ PXA ^ P) ’ (10 ' 71) 

where reality of A M (x) implies A^(—p) = {A^(p))*. Substituting (10.71) into (10.70) we 
obtain the equation: 

p 2 A^ — p^(p • A) = 0 . (10.72) 

The gauge transformation (10.68) can also be Fourier transformed. In momentum space, 
the gauge transformation relates SA^ip) to the Fourier transform e(p) of the gauge 
parameter: 

SA^(p) = ip^eip) . (10.73) 

Since the gauge parameter e(x) is real, we have e(—p) = e*(p). The gauge transformation 

(10.73) is consistent with the reality of S A^(x). Indeed, 

(SA^p))* = -ip„(e(p)f = K-zvM-p) = SA^-p) . (10.74) 

Note the role of the factor of i in getting the signs to work out. 

Being done with preliminaries, we can analyze (10.72) subject to the gauge transformations 

(10.73) . At this point, it is more convenient to work with the light-cone components of the 
gauge field: 

A+{p), A~(p ) , A 1 {p). (10.75) 

The gauge transformations (10.73) then read 

8A + = ip + e, 8A~ = ip~e, 8A I —ip I e. (10.76) 

We now impose a gauge condition. As we have emphasized before, when working with the 
light-cone formalism we always assume p + ^ 0. The above gauge transformations now 
make it clear that we can set A + to zero by choosing e correctly. Indeed, if we apply a 
gauge transformation 

A+ -* A ,+ = A+ + ip + e, (10.77) 

then the + component of the new gauge field A! vanishes if we choose e = iA + /p + . In 
other words, we can always make the + component of the Maxwell field zero by applying 
a gauge transformation. This will be our defining condition for the light-cone gauge in 
Maxwell theory: 



light-cone gauge condition : A + (p ) = 0. (10.78) 



Setting A + to zero determines the gauge parameter e, and no additional gauge transfor- 
mations are possible: if A + = 0, any further gauge transformation will make it nonzero. 
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There is one minor exception: any parameter of the form e(p) — e(p~ , p ! )S(p + ) will 
keep A + — 0 because p + e(p) = 0. The similarities with light-cone gauge string theory 
are noteworthy. In light-cone gauge open string theory all world-sheet reparameteriza- 
tion invariances are fixed. Moreover, while not equal to zero, X + is very simple: the 
corresponding zero mode and oscillators vanish. 

The gauge condition (10.78) simplifies the equation of motion (10.72) considerably. Taking 
/x = + we find 

p + (p ■ A) = 0 — > p ■ A = 0. (10.79) 

This equation can be expanded out using light-cone indices: 

- p + A~ - p~A + + p 1 A 1 = 0 . (10.80) 

Since A + = 0, this equation determines A~ in terms of the transverse A 1 : 

A~ = — r ip 1 A 1 ) . (10.81) 

P + 

This is reminiscent of our light-cone analysis of the string, where X~ was solved for in 
terms of the transverse coordinates (and a zero mode). Using (10.79) back in (10.72), all 
that remains from the field equation is 

p 2 A tl (p) = 0. (10.82) 

For p = + this equation is trivially satisfied, since A + = 0. For p = I we get a set of 
nontrivial conditions: 

p 2 A I (p) = 0. (10.83) 

For p — — , we get p 2 A~(p) — 0. This is automatically satisfied on account of (10.81) and 
(10.83). 

For each value of I, equation (10.83) takes the form of the equation of motion for a 
massless scalar. Thus, A 1 (p) = 0 when p 2 ^ 0. This makes A~ — 0, and since A + is zero, 
the full gauge field vanishes. For p 2 — 0, the A 1 (p) are unconstrained, and the A~(p) are 
determined as a function of the A ! (see (10.81)). The degrees of freedom of the Maxwell 
field are thus carried by the ( D — 2) transverse fields A 1 (p), for p 2 — 0. We say that we 
have (D — 2) degrees of freedom per point on the mass-shell. 

It is actually possible to show that there are no degrees of freedom for p 2 ^ 0, without 
having to make a choice of gauge. Although not every field is zero, every field is gauge 
equivalent to the zero field when p 2 ^ 0. If a field differs from the zero field by only a 
gauge transformation, we say that the field is pure gauge. Recall that fields A and A' are 
gauge equivalent if A^ — A'^ + 3^ x , for some scalar function / . Taking A' fl = 0, we learn 
that A^ — is gauge equivalent to the zero field, and is therefore pure gauge. The term 
pure gauge is suitable: A lt takes the form of a gauge transformation. In momentum space, 
a pure gauge is a field that can be written as 



pure gauge: A^p) = ip^xip) , 



( 10 . 84 ) 
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for some choice of / . Rewrite now the equation of motion (10.72) as 



Since p 2 ^ 0, we can write 



P 2a ih — Pn(P ' A ) ■ 




(10.85) 

( 10 . 86 ) 



Comparing with (10.84), we see that ,4 /; is pure gauge. This means that there are no degrees 
of freedom in the Maxwell field when p 2 ^ 0. For all intents and purposes, there is no field. 



Let us now discuss briefly photon states. Each of the independent classical fields A 1 can 
be expanded as we did for the scalar field in (10.58). To do so, we introduce - as you 
can infer by analogy - oscillators a^ and a 1 ^ , where the subscripts p represent the values 
of p + and pr ■ We thus get ( I) — 2) species of oscillators. Introducing a vacuum £2), the 
one-photon states would be written as 

a’ p \' PT |S2>. (10.87) 

Here the label I is a polarization label. The photon state (10.87) is said to be polarized in 
the 7th direction. Since we have (D — 2) possible polarizations, we have ( D — 2) linearly 
independent one-photon states for each point on the physical sector of the mass-shell. 
A general one-photon state with momentum ( p + , pj) is a linear superposition of the above 
states: 



D - 1 

one-photon states: £& a J+, w i n >- (10 - 88) 

1=2 



Here the transverse vector f/ is called the polarization vector. 

In four-dimensional spacetime. Maxwell theory gives rise to D — 2 — 2 single-photon 
states for any fixed spatial momentum. This is indeed familiar to you, at least classically. 
An electromagnetic plane wave which propagates in a fixed direction and has some fixed 
wavelength (i.e., fixed momentum), can be written as a superposition of two plane waves 
that represent independent polarization states. 



10.6 Gravitational fields and graviton states 



Gravitation emerges in string theory in the language of Einstein’s theory of general relativ- 
ity. We discussed this language briefly in Section 3.6. The dynamical field variable is the 
spacetime metric g !lv (x), which in the approximation of weak gravitational fields can be 
taken to be of the form g^ v {x) = p^v + h^ v (x). Both g and are symmetric under the 
exchange of their indices. The field equations for g^ v - Einstein’s equations - can be used 
to derive a linearized equation of motion for the fluctuations h jlv . This equation was given 
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in (3.82). Defining h^ v (p) to be the Fourier transform of li /lv (x), the momentum-space 
version of this equation is 

S^ip) = - pAp^h™ + p v h^ a ) + p»p v h=0. (10.89) 

If we were considering Einstein’s equations in the presence of sources, the right-hand 
side of this equation would include terms which represent the energy-momentum ten- 
sor of the sources. In the above equation h — r] llv h lu , = h 1 ^, and indices on h jLV can be 
raised or lowered using the Minkowski metric r]^ v and its inverse rj 111 ’ . Since every term in 

(10.89) contains two derivatives, this suggests that the fluctuations h^ v are associated with 
massless excitations. 

As we will see shortly, the equation of motion (10.89) is invariant under the gauge 
transformations discussed in Section 3.6: 

Soh^’ip) = ip»e v (p) + ip v ^{p ) . (10.90) 

The infinitesimal gauge parameter e^ip) is a vector. In gravitation, the gauge invariance 
is reparameterization invariance: the choice of coordinate system used to parameterize 
spacetime does not affect the physics. 

Let us verify that (10.89) is invariant under the gauge transformation (10.90). First we 
compute 8 0 h and find that 

S 0 h = ri llv 8 0 h llv = ir] /lv (p ll e v + p v e M ) = 2 ip ■ e . (10.91) 

The resulting variation in S 1 ' 1 ' is therefore given by 

8 0 S llv = ip 2 (p»e v + p v en - i PaP A P v e a + p a e v ) 

- ip a p v {p^e a + p a e 11 ) + 2/p» • e. (10.92) 

But we can rewrite 

8 0 S» V = ip 2 (p^e v + p v e») - ip^pAp ■ €) - *>VV 

- ip< J ‘p v (p ■ e) - ip 2 p v e » + 2 ip* p v p ■ e. (10.93) 

It is readily seen that all the terms in (10.93) cancel, so 8oS ,lv = 0. The equation of motion 
exhibits the claimed gauge invariance. 

Since the metric h llv is symmetric and has two indices, each running over (+, — , /), we 
have the following objects to consider: 

(h IJ , h +I , h~’, h + ~, h ++ , h ~~) . (10.94) 

We shall try to set to zero all the fields in (10.94) that contain a + index. For this, we use 

(10.90) to examine their gauge transformations: 

8 0 h ++ = 2 ip + € + , 

8oh + ~ = ip + e~ + ip~e + , 

8oh +I = ip + e ! + ip 1 e + . 



(10.95) 

(10.96) 

(10.97) 
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As before, we assume p + ^ 0. From (10.95), we see that a judicious choice of e + will 
permit us to gauge away h ++ . This fixes our choice of e + . Looking at equation (10.96), 
we see that although we have fixed e + , we can still find an e~ that will gauge away /i" 1 . 
This fixes e~ . Similarly, we can use (10.97) and a suitable choice of e 1 to set h +I to zero. 
We have used the full gauge freedom to set to zero all of the entries in li 1 '" with + indices. 
This defines the light-cone gauge for the gravity field: 



light-cone gauge conditions: h ++ — h + = h +l — 0. (10.98) 



The remaining degrees of freedom are carried by 

(h IJ , /? _/ , h ) . (10.99) 

We must now see what is implied by the equations of motion (10.89). Bearing in mind 
the gauge conditions (10.98), when p, = v — + we find 

{p+fh = 0 — > h = 0. (10.100) 

More explicitly, 

h = rt^h^ = -2 h + ~ + h 11 = 0 — ^ h 11 = 0 , (10.101) 

since /i H = 0 in our gauge. The equation h 11 = 0 means that the matrix h ,J is traceless. 
With h = 0, the equation of motion (10.89) reduces to 

p 2 h ^ - p ll (p a h va ) - p v (p a h' m ) = 0. (10.102) 

Now set (j, = +. We obtain p + (p a h va ) = 0, and as a result 

p a h va = 0. (10.103) 

If (10.103) holds, equation (10.102) reduces to 

7 rV y =0. (10.104) 

This is all that remains of the equation of motion! Before delving into this familiar equa- 
tion, let us investigate the implications of (10.103). The only free index here is v. For 
v = + the equation is trivial, since the h +a are zero in our gauge. Consider now v — I . 
This gives p a h Ia — 0, which we can expand as 

- p+h'- - p~h I+ + PJ h IJ =0 — » h‘- = — p jh IJ . (10.105) 

P + 

Similarly, with v — — we get p a h~ a = 0, which expands to 

— p + h — p~h ^ + p\h~ l = 0 — » li = — -pih~ l . (10.106) 

P + 

Equations (10.105) and (10.106) give us the h with — indices in terms of the transverse 
h I J . There is no additional content to (10.103). 
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We now return to equation (10.104). This equation holds trivially for any field with a + 
index. The equation is nontrivial for transverse indices: 



p 2 h IJ (p) = 0. (10.107) 

Equations p 2 h l ~ = 0 and p 2 li = 0 are automatically satisfied on account of our 
solutions (10.105) and (10.106), together with (10.107). Equation (10.107) implies that 
h IJ (p) = 0 for p 2 ^ 0, in which case all other components of h ll v also vanish. For p 2 — 0, 
the h IJ (p ) are unconstrained, except for the tracelessness condition hu(p) — 0. All other 
components are determined in terms of the transverse components. 

We conclude that the degrees of freedom of the classical Z7 -dimensional gravitational 
field are carried by a symmetric, traceless, transverse tensor field h IJ , the components of 
which satisfy the equation of motion of a massless scalar. This tensor has as many compo- 
nents as a symmetric traceless square matrix of size (D — 2). The number of components 
n(D ) in this matrix is 



n(D) = \{D- 2 )(Z7 - 1) - 1 = X - D(D - 3) . (10.108) 

Moreover, as before, we count a massless scalar as one degree of freedom per point on 
the mass-shell. Therefore we say that a classical gravity wave has n(D) degrees of free- 
dom per point on the mass-shell. In four-dimensional spacetime there are two transverse 
directions, and a symmetric traceless 2x2 matrix has two independent components. In 
four dimensions we thus have n(4) = 2 degrees of freedom. In five dimensions we have 
«(5) = 5 degrees of freedom, in ten dimensions n{ 10) = 35 degrees of freedom, and in 
twenty-six dimensions «( 26) = 299 degrees of freedom. To obtain graviton states, each of 
the independent classical fields h ,J fields is expanded in terms of creation and annihilation 
operators, just as we did for the scalar field in (10.58). To do so we need oscillators ^ 

and a 1J + . We introduce a vacuum I £2), and a basis of states 

P' , pj 1 ' 

(10.109) 

A one-graviton state with momentum ( p + , p j ) is a linear superposition of the above states: 



D - 1 

one-graviton states : Hua 1 ^ |£2) , £// = 0. (10.110) 

7 . 7=2 



Here j is the graviton polarization tensor. The classical tracelessness condition that we 
found earlier becomes in the quantum theory the tracelessness £// = 0 of the polarization 
tensor. Since f/y is a traceless symmetric matrix of size (77 — 2), we have n ( i ) ) linearly 
independent graviton states for each point on the physical mass-shell. 
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Problems 



Problem 10.1 Momentum for the classical scalar field. 



Show that the integral (10.46), when evaluated for the field configuration (10.33) gives 

P — — (a* a — a* a) . 

2E p y > 

Use (10.44) to show that P = p (a*a p — a*_ p a , as quoted in (10.47). 

Problem 10.2 Commutator for the quantum scalar field. 



(a) Consider a periodic function f(x) on the box described above equation (10.34). Such 
a function can be expanded as the Fourier series 

/(*) = I]/(P)* ,>5 - (D 

p 

Show that 

f(p)= ^ f dx' f(x')e-‘P - :x ' . (2) 

Plug (2) back into (1) to obtain an infinite sum representation for the cf-dimensional 
delta function S d (x —x'). 

(b) Consider the complete scalar field expansion in (10.58). Calculate the corresponding 
expansion of n(f, x) — do <p(t, x). Show that 

[</>(t,x),Tl(t,x')] = i8 d (x-x'). (3) 

This is the equal-time commutator between the field operator and its corresponding 
conjugate momentum. Most discussions of quantum field theory begin by postulating 
this commutator. 



Problem 10.3 Light-cone components of Lorentz tensors.^ 

(a) Verify that the Lorentz covariant equation A M = B 11 , for // = 0. 1, . . ., d, implies that 
A+ = B+, A~ = B~, and A 1 = B l . 

Given a Lorentz tensor R l,v , how do we define the light-cone components 
/U , R ++ , . . .? To find out, note that the definition must work for any tensor, so it must 
work when R^ v = A 1 ' B v . Thus, for example, /U = A + B~ , and writing A + and B~ in 
terms of Lorentz components, you can determine R "* in terms of R 00 , R l>l , R l0 , and R l 1 . 

(b) Calculate R ++ , /U , R~ + , and R in terms of the Lorentz components of R ,,v . 
Explain why an equality R llv = S llv between Lorentz tensors implies the equality of 
the light-cone components. 

(c) Check that your result in (b) gives the expected answers for the light-cone components 

of the Minkowski metric: ;? ++ , , rj and ;; . 
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(d) Consider the antisymmetric electromagnetic field strength F lll! in four dimensions. 
Calculate the light-cone components /- H , F +I , F~ ! , and F IJ in terms of the Lorentz 

components of F llv . Rewrite your answers in terms of E and B fields. 

Problem 10.4 Constant electric field in light-cone gauge. 

Find potentials that describe a uniform constant electric field E = Eoe x in the light- 
cone gauge ( A + — (A 0 + A 1 )/-/^ = 0). Write A~ and A 1 in terms of the light-cone 
coordinates x + , x~, and x 1 . 

Problem 10.5 Gravitational fields that are pure gauge. 

Following the discussion of Maxwell fields that are pure gauge, define gravitational fields 
that are pure gauge. Prove that any gravitational field h^Ap) which satisfies the equations 
of motion is pure gauge when p 2 ^ 0. 

Problem 10.6 Field equations and particle states for the Kalb-Ramond field B IJV .' 

Here we examine the field theory of a massless antisymmetric tensor gauge field 
B pv = — B vpL . This gauge field is a tensor analog of the Maxwell gauge field A p . In 
Maxwell theory we defined the field strength F^ v = 9 /; A l; — d v A jJL . For B pv we define 
a field strength H jlvp \ 

Hfivp — d// B vp dvBpp A- dp Bp v . 

(a) Show that H llvp is totally antisymmetric. Prove that H p Vp is invariant under the gauge 
transformations 

SBpv = - d v e p . 

(b) The above gauge transformations are peculiar: the gauge parameters themselves have 
a gauge invariance! Show that e ' given as 

e 'n = ’ 

generates the same gauge transformations as e /t . 

(c) Use light-cone coordinates and momentum space to argue that e + ( p) can be set to zero 
for a suitable choice of X(p). Thus, the effective gauge symmetry of the Kalb-Ramond 
field is generated by the gauge parameters e 1 (p) and e~(p). 

(d) Consider the spacetime action principle 

S~ J d D x(- l -H^ vp H^Py 

Find the field equation for If , ,, , and write it in momentum space. 

(e) What are the suitable light-cone gauge conditions for B l,v l Bearing in mind the 
results of part (c), show that these gauge conditions can be implemented using the 
gauge invariance. Analyze the equations of motion, show that p 1 B lxv = 0, and find 
the components of B I1V which represent truly independent degrees of freedom. 



215 



10.6 Gravitational fields and graviton states 



(f) Argue that the one-particle states of the Kalb-Ramond field are 

D - 1 

E ^v!„ r 

I,J=2 

What kind of matrix is £//? 

Problem 10.7 Massive vector field.’ 

The purpose of this problem is to understand the massive version of Maxwell fields. We 
will see that in D -dimensional spacetime a massive vector field has (!) — 1) degrees of 
freedom. 

Consider the action S = f d D x £ with 

£ = - * - l -m 2 A^ - E M 0 3^0 + mA 11 3 M 0 . 

The first term in £ is the familiar one for the Maxwell field. The second looks like a mass 
term for the Maxwell field, but alone would not suffice. The extra terms show the real 
scalar field 0 that, as we shall see, is eaten to give the gauge field a mass. 

(a) Show that the Lagrangian £ is invariant under the infinitesimal gauge transformation 

8 A^ = , 8(f> = .... 

where the dots denote an expression that you must determine. While the gauge field 
has the familiar Maxwell gauge transformation, it is unusual to have a real scalar field 
with a gauge transformation. 

(b) Vary the action and write down the field equations for A 11 and for 0. 

(c) Argue that the gauge transformations allow us to set 0 = 0. Since the field 0 disappears 
from sight, we say it is eaten. What do the field equations in part (b) simplify into? 

(d) Write the simplified equations in momentum space and show that for p 2 ^ —m 2 there 
are no nontrivial solutions, while for p 2 = —m 2 the solution implies that there are D — 
1 degrees of freedom. (It may be useful to use a Lorentz transformation to represent 
the vector p fl which satisfies p 2 = —m 2 as a vector that has a component only in one 
direction.) 



r 

11 The relativistic quantum point particle 

L 



To prepare ourselves for quantizing the string, we study the light-cone gauge 
quantization of the relativistic point particle. We set up the quantum theory by 
requiring that the Heisenberg operators satisfy the classical equations of motion. 
We show that the quantum states of the relativistic point particle coincide with 
the one-particle states of the quantum scalar field. Moreover, the Schrodinger 
equation for the particle wavefunctions coincides with the classical scalar field 
equations. Finally, we set up light-cone gauge Lorentz generators. 



11.1 Light-cone point particle 



In this section we study the classical relativistic point particle using the light-cone gauge. 
This is, in fact, a much easier task than the one we faced in Chapter 9, where we examined 
the classical relativistic string in the light-cone gauge. Our present discussion will allow 
us to face the complications of quantization in the simpler context of the particle. Many of 
the ideas needed to quantize the string are also needed to quantize the point particle. 

The action for the relativistic point particle was studied in Chapter 5. Let us begin our 
analysis with the expression given in equation (5.15), where an arbitrary parameter r is 
used to parameterize the motion of the particle: 



5 = 



— m 




dx ^ dx v 
dr dr 



dr . 



(1L1) 



In writing the above action, we have set c = 1. We will also set h = 1 when appropriate. 
Finally, the parameter r will be dimensionless, just as it was chosen to be for the relativistic 
string starting in Chapter 9. We can simplify our notation by writing 



dx M dx v 
dr dr 



IJ^vX 1 * x v = x 2 . 



( 11 . 2 ) 



Thinking of r as a time variable and of the x^(r) as coordinates, the action S defines a 
Lagrangian L as 



5 = 



j-Zf 

JXj 



L dr . 



= — Illy/ —X 2 



(11.3) 
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As usual, the momentum is obtained by differentiating the Lagrangian with respect to the 
velocity: 

3 L mx,, 

p u = = . (11.4) 

The Euler-Lagrange equations which arise from the Lagrangian are 



dpi, 

dr 



= 0. 



(11.5) 



All components of the momentum are constants of the motion. Given (11.4) one readily 
checks that the momentum components satisfy the constraint 



p 2 + m 2 = 0 . 



(11.6) 



To define the light-cone gauge for the particle, we set the coordinate x + of the particle 
proportional to r : 




The factor of m 2 on the right-hand side is needed to get the units to work. Now consider 
the + component of equation (1 1.4): 



P = 



V-i 2 V-i 2 m 

Cancelling the common factor of p + , and squaring, we find 

1 



i 2 = - 



,2 ' 



This result helps us simplify the expression (11.4) for the momentum: 

Pu = m 2 . 



( 11 . 8 ) 



(11.9) 



( 11 . 10 ) 



The appearance of m 2 , as opposed to m, is due to our choice of unitless r. The equation of 
motion (1 1.5) then gives 

Xfi = 0. ( 11 . 11 ) 

Expanding the constraint (1 1.6) in light-cone components, 



1 



-2 p + p + p l p 1 + m 2 — 0 — >p — ——(p 1 p 1 + m z ) . (11.12) 

2p+ 

The value of p~ is determined by p + and the components p 1 of the transverse momentum 
pr ■ Having solved for p~, equation (11.10) gives 



dx~ 



1 



= ~2 P 
dr m 



(11.13) 
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which is integrated to find 

x“(r) = xT + ^ x , (11.14) 

m- 

where is a constant of integration. Equation (11.10) also gives dx 1 /dr — p 1 /m 2 , 
which is integrated to give 

x 7 (r) = Xq + ^ x , (11.15) 

m- 

where x q is a constant of integration. Note that the light-cone gauge condition (11.7) 
implies that x + (r) has no constant piece Xq . 

The specification of the motion of the point particle is now complete. Equation (11.12) 
tells us that the momentum is completely determined once we fix p + and the components 
p 1 of the transverse momentum pj . The motion in the x~ direction is determined by 
(1 1.14), once we fix the value of x^. The transverse motion is determined by the x 7 (r), or 
the Xq , since we presume to know the p 1 . For a symmetric treatment of coordinates versus 
momenta in the quantum theory, we choose the x 1 as dynamical variables. Our independent 
dynamical variables for the point particle are therefore 

Dynamical variables: ^x 7 , x// , p l , p + ^j . (11.16) 

11.2 Heisenberg and Schrodinger pictures 



Traditionally, there are two main approaches to the understanding of time evolution in 
quantum mechanics. In the Schrodinger picture, the state of a system evolves in time, 
while operators remain unchanged. In the Heisenberg picture, it is the operators which 
evolve in time, while the state remains unchanged. Of the two, the Heisenberg picture is 
more closely related to classical mechanics, where the dynamical variables (which become 
operators in quantum mechanics) evolve in time. Both the Schrodinger and the Heisenberg 
pictures will be useful in developing the quantum theories of the relativistic point particle 
and the relativistic string. Because we would like to exploit our understanding of classical 
dynamics in developing the quantum theories, we will begin by focusing on the Heisenberg 
picture. 

Both the Heisenberg and the Schrodinger picture make use of the same state space. 
Whereas in the Heisenberg picture the state which represents a particular physical sys- 
tem is fixed in time, in the Schrodinger picture the state of a system is constantly changing 
direction in the state space in a manner which is determined by the Schrodinger equation. 
Although we generally think of the operators in the Schrodinger picture as being time inde- 
pendent, there are those which depend explicitly on time and therefore have time depen- 
dence. These operators are formed from time-independent operators and the variable t. 
For example, the position and momentum operators q and p are time independent. But 
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the operator O = q + p t has explicit time dependence. If it has explicit time dependence, 
even the Hamiltonian H(p, q; t ) can be a time-dependent operator. 

Now, as we move from the Schrodinger to the Heisenberg picture, we will encounter oper- 
ators with two types of time dependence. As we noted earlier, Heisenberg operators have 
time dependence, but this time dependence can be both implicit and explicit. The Heisen- 
berg equivalent of a time-independent Schrodinger operator is said to have implicit time 
dependence. This implicit time dependence is due to our folding into the operators the 
time dependence which, in the Schrodinger picture, is present in the state. If a Heisenberg 
operator is explicitly time dependent it is because the explicit time dependence of the cor- 
responding Schrodinger operator has been carried over. 



When we pass from the Schrodinger to the Heisenberg picture, the time-independent 
Schrodinger operators q and p, for example, become q(t) and pit), respectively. The 
Schrodinger commutator [q , p] = i turns into the commutator 



[q(t), p(t)] = i . 



(11.17) 



Although q{t) and pit) depend on time, their time dependence is implicit. If £(f) is 
a Heisenberg operator arising from a time-independent Schrodinger operator, the time 
evolution of § (r) is governed by 

= [$(0, H(p(t),q(ty,t)]. (11.18) 



Here H(p(t), q(t); t ) is the Heisenberg Hamiltonian corresponding to the possibly time- 
dependent Schrodinger Hamiltonian Hip, q; t ). 

If Oit) is the Heisenberg operator which corresponds to an explicitly time-dependent 
Schrodinger operator, then the time evolution of Oit) is given by 



dOit) dO 

i = i 

dt 3 1 



\Oit), Hipit), qit); t) . 



(11.19) 



This equation reduces to (1 1.18) when the operator has no explicit time dependence. If the 
Hamiltonian H{pit), qit)) has no explicit time dependence, then we can use (11.18) with 
£ = H, to find 

Hipit), qit)) = 0. (11.20) 

dt 

In this case the Hamiltonian is a constant of the motion. 



The discussion above must be supplemented by rules to pass from Schrodinger to Heisen- 
berg operators. Assume that the Schrodinger Hamiltonian Hip, q) is time independent. In 
this case a state I'T) at time t — 0 evolves in time, and is given at time t by 

I'M) =e~ iHt \'l '). 



( 11 . 21 ) 
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Quick calculation 11.1 Confirm that |VP, t) satisfies the Schrodinger equation 




dt 



(11.22) 



It is clear from (11.21) that the operator e lHt brings time-dependent states to rest: 



If we act with this operator on the product ojfi/, t), where a is a Schrodinger operator, we 
find 



where a(t ) = e ,Ht a e~ ,Ht is the Heisenberg operator corresponding to the Schrodinger 
operator a. This definition also applies for a Schrodinger operator as(t) that has explicit 
time dependence, in which case the corresponding Heisenberg operator is = 

e lH, as The construction ensures that given a set of Schrodinger operators that 

satisfy certain commutation relations, the corresponding Heisenberg operators satisfy the 
same commutation relations. 

Quick calculation 11.2 If [ai, 012 ] = 0:3 holds for Schrodinger operators ai, aj, and 0 : 3 , 
show that [aq(r), « 2 (t)] = U 3 (t) holds for the corresponding Heisenberg operators. 

This result holds even if the Hamiltonian is time dependent (Problem 11.2). It justifies the 
commutator in (1 1.17), noting that the constant right-hand side is not affected by the rule 
turning a Schrodinger operator into a Heisenberg operator. 



We now develop a quantum theory from the classical theory of the relativistic point particle. 
We will define the relevant Schrodinger and Heisenberg operators, including the Hamilto- 
nian, and describe the state space. All of this will be done in the light-cone gauge. 

Our first step is to choose a set of time-independent Schrodinger operators. A reasonable 
choice is provided by the dynamical variables in (1 1.16): 



time-independent Schrodinger operators : ( x 1 , x () , p ! , P + ) ■ (11.25) 



e iH, \ V,t) = |vp) . 



(11.23) 



e iHt a\^,t) = e iHt ae~ iH, \^) = a(f)|'P> , 



(11.24) 



11.3 Quantization of the point particle 



We could include hats to distinguish the operators from their eigenvalues, but this will not 
be necessary in most cases. We parameterize the trajectory of a point particle using r, so 
the associated Heisenberg operators are: 
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Heisenberg operators : ^x / (r), x Q (r) , p 1 (x) , p + ( r)^ . (11.26) 

We postulate the following commutation relations for the Schrodinger operators: 

[x 1 , p J ] = i r] IJ , [xq , p + ] = i ri~ + = —i , (11.27) 



with all other commutators set equal to zero. The first commutator is the familiar commuta- 
tor of spatial coordinates with the corresponding spatial momenta (recall that r\ I] = 8 IJ ). 
The second commutator is well motivated, after all, x^ is treated as a spatial coordinate 
in the light-cone, and p + is the corresponding conjugate momentum. The second com- 
mutator, just like the first one, has an i] carrying the indices of the coordinate and the 
momentum. 

The Heisenberg operators, as explained earlier, satisfy the same commutation relations 
as the Schrodinger operators: 

[x , (r),p J (T)] = ir l IJ , [ X q (r), — —i , (11.28) 

with all other commutators set equal to zero. 

We have discussed the operators that correspond to the independent observables of the clas- 
sical theory. But just as there are classical observables which depend on those independent 
ones, there are also quantum operators which are constructed from the set of independent 
Schrodinger operators, and time. These additional operators are x + (r),x _ (t) and p~ . 
These operators are defined using the quantum analogs of equations (11.7), (11.14), and 
(11.12): 

, p + 

x+(r) = (11.29) 

m z 

X~(t) = X n “+^rT, (11.30) 

m z 

p~ = P 1 + m2 ) ■ (11.31) 

Note that p~ is time independent. Both x + (r) and x _ (r) are time-dependent Schrodinger 
operators. 

The commutation relations involving the operators x + (r),x _ (r), and p~ are deter- 
mined by the postulated commutation relations in (11.27), along with the defining 
equations (1 1.29) — (1 1.31). The decision to choose the operators in (1 1.25) as the indepen- 
dent operators of our quantum theory was very significant. For example, if we had chosen 
x+ and p~ to be independent operators, we might have been led to write a commutation 
relation [x + , /?“] = —i . In our present framework, however, this quantity vanishes, since 

lp + ,p'] = o. 
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We have not yet determined the Hamiltonian H . Since p is the light-cone energy (see 
(2.94)), we expect it to generate x + evolution: 



9 

dx + 



(11.32) 



Although x + is light-cone time, we are parameterizing our operators with r, so we expect 
H to generate r evolution, which is related, but is not the same as x + evolution. Since 
x + = p + x/m 2 , we can anticipate that r evolution will be generated by 



9 p + 9 p + 

9r m 2 9x+ m 2 ^ 



(11.33) 



We therefore postulate the Heisenberg Hamiltonian 




Note that H{ r) has no explicit time dependence. Equation (1 1.20) applies, and as a result, 
the Hamiltonian is actually time independent. 



Let us now make sure that this Hamiltonian generates the expected equations of motion. 
First we check that H gives the correct time evolution of the Heisenberg operators (1 1.26) 
which arise from the time-independent Schrodinger operators. The equation governing the 
time evolution of those operators is (11.18). Let us begin with p + and p 1 : 

i^^- = [p+(x),H( r)] = 0, 

i ^^- = [p I (T), H (T)]=0. (11.35) 



Both of these commutators vanish because H is a function of p 1 (r) alone, and all the 
momenta commute. Equations (11.35) are good news, because the classical momenta p + 
and p 1 are constants of the motion. This allows us to write p 1 (t) = p 1 and p + (r) = p + . 
We now test the r development of the Heisenberg operator x 1 (r): 



dx 1 { r) 

i 

dr 



[*'«. W(pV + „,2) 




m 2 



(11.36) 



Here, we have used [x 1 , p J p J ] = [x 1 , p J ]p J + p J [x ! , p J ] = 2i p 1 . Cancelling the 
common factor of i in (1 1.36), we find 



dx 1 (x) p 1 

dx m 2 



(11.37) 



This result is in accord with our classical expectations and allows us to write 



= *o + ^7 T - 



(11.38) 
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where xk is an operator without any time dependence. Finally, we must examine x Q (r). 
Since x^"(r) commutes with p 1 (r), 



. dx 0 (r) 
dr 




h( p, ‘ 



2 m 2 




= 0 . 



(11.39) 



As expected, this operator is a constant of the motion, and we can write Xq(t) — Xq . So 
as far as the operators in (11.25) are concerned, our ansatz for H functions properly as a 
Hamiltonian. 



We now turn to the remaining operators x + (r), x _ (r), and p~( r). Of these, p~( r) is 
a function of the p 1 only and is therefore time independent. It is easy to to see that the 
commutator with H vanishes, so we have nothing left to check for this operator. The 
Heisenberg operators x + (r) and x _ (r) both arise from Schrodinger operators with explicit 
time dependence, so we use (11.19) to calculate their time evolution. For example: 

dx~( r) dx~ r -i 

i— - = i— + [x~(T),H(T)]. (11.40) 

dr dr 

Since x _ (r) = x^ + p~r/m 2 and both x ^ and p~ commute with the p 1 , we see that 
[x _ (r), H{r )] = 0. Consequently, 



dx (r) p 
dr m~ 



(11.41) 



which is the expected result. Similarly, since x + (r) = p + r/m 2 , we find that 
[x + (r), H (r)] = 0, and therefore 



dx + (r) 

dr 



dx + 

dr 



(11.42) 



These computations show that our ansatz (11.34) for the Hamiltonian generates the 
expected equations of operator evolution. 



Quick calculation 11.3 We introduced x^ in (11.38) as a constant operator. Show that 
dx{ } /d r must be calculated by viewing x ( - as the explicitly time-dependent Heisenberg 
operator defined by (1 1.38). 



To complete our construction of the point particle quantum theory we must develop 
the state space, set up the Schrodinger equation, and define physical states. The time- 
independent states of the quantum theory are labeled by the eigenvalues of a maximal set 
of commuting operators. For the set of operators introduced in (11.25), a maximal com- 
muting subset can include only one element from the pair ( x ~ , p + ), and one element from 
each of the pairs (x 1 , p 1 ). Because it is usually convenient to work in momentum space, 
we will work with the operators p + and p 1 . So we write the states as 



states of the quantum point particle: | p + , pr) ■ 



(11.43) 
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where p + is the eigenvalue of the p + operator, and pj is the transverse momentum, the 
components of which are the eigenvalues of the p 1 operators: 

P + \P + , Pt) = P + \P + , Pt) > P 1 \P + , Pt) = P 1 \P + , Pt) ■ (11.44) 

In light of (11.31), these equations imply 

P~\P + , Pt) = ^~+{p' p' + m2 )\ P + > Pt) ■ (11.45) 

In addition, the Hamiltonian (1 1.34) acting on the states gives 

H \ p+ , pr) = 2^2 (/>V +m 2 )\p+, pr). (11.46) 

It then follows that the time-dependent states 

ex p(-i 2 ~2( pI pI + m2 ) r ) I P + ’ Pt) (11.47) 

satisfy the Schrodinger equation. They are the physical time-dependent states associated 
with the states in (1 1.43). 

More generally, we consider time-dependent superpositions of the basis states in ( 1 1 .43): 

I'b, t) = J dp + dp T VKu P + ,Pt) \p + , Pt) ■ (11.48) 

Since p + and pj are continuous variables, an integral is necessary. To produce a general r- 
dependent superposition, we introduced the arbitrary function i/r(r, p + , pr)- In fact, this 
function is the momentum-space wavefunction associated with the state |vk, r). Indeed, 
with dual bras ( p + , pj\ defined to satisfy 





(p' + > Pt\ P + ’ Pt) = s (p' + ~ P + )S(Pt - pt) , 


(11.49) 


we see that 








( P + , Pt\W, t) = f(r, p + , p T ) ■ 


(11.50) 


The Schrodinger equation for the state |'I», r) is 






i-H-\*,*) = H\V, r>. 
3r 


(11.51) 



Using the state in (1 1.48) and the Hamiltonian in (1 1.34), we find 

Jdp + dp T [/^r/r(r, p + , p T ) - ^ (V p 1 +m 2 )r/r( r, p + , /5r)]lf> + , Pt) = 0. 

(11.52) 

Since the basis vectors | p + ,pr) are all linearly independent, the expression within 
brackets must vanish for all values of the momenta: 

P+ ’ PT ^ = 2m^( P ‘ pI +, ” 2 )^^ t ’ P + ’ PT )• (11.53) 

We recognize this equation as a Schrodinger equation for the momentum-space wavefunc- 
tion t/c (r, p + , pj)- If the wavefunction satisfies the Schrodinger equation, the state T, r) 
is a physical time-dependent state. This completes our development of the point particle 
quantum theory. 
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11.4 Quantum particle and scalar particles 



The states of the quantum point particle given in ( 1 1 .43) may remind you of the one-particle 
states (10.66) in the quantum theory of the scalar field. This is actually a fundamental 
correspondence. 

There is a natural identification of the quantum states of a relativistic point 
particle of mass m with the one-particle states of the quantum theory of a 
scalar field of mass m: 

\P + , Pt) ► aJ + , pr |S2). (11-54) 



The identification is possible because the labels of the point particle states match with the 
labels of the creation operators which generate the one-particle states of the scalar quan- 
tum field theory. The correspondence between the quantum point particle and the quantum 
scalar field theory can be extended from the state space to the operators that act on the state 
space. The quantum point particle theory has operators p + , p 1 , and p~, and so does the 
quantum field theory, as shown in (10.67). If we identify the state spaces using (11.54), 
then the two sets of operators give the same eigenvalues. This makes the identification 
natural. 

The above observations lead us to conclude that the states of the quantum point particle and 
the one-particle states of the scalar field theory are indistinguishable. Because it contains 
creation operators that can act multiple times on the vacuum state, the scalar field theory 
has multiparticle states that did not arise in our quantization of the point particle. Indeed, 
there are no creation operators in the theory of the quantum point particle. Because it pro- 
vides a natural description of multiparticle states, the scalar field theory can be said to be a 
more complete theory. 

How could we have anticipated that the one-particle states of a quantum scalar field theory 
would match those of the quantum point particle? The answer is quite interesting: the 
Schrodinger equation for the quantum point particle wavefunctions has the form of the 
classical field equation for the scalar field. More precisely: 

There is a canonical correspondence between the quantum point particle wave- 
functions and the classical scalar field, such that the Schrodinger equation for 
the quantum point particle wavefunctions becomes the classical field equation 
for the scalar field. 

One element of this correspondence is the classical field equation for the scalar field. In 
light-cone gauge, this equation takes the form (10.31): 

(i 7^ - ^ (pV +'» 2 ))</>0h p + , pr) = 0. 



(11.55) 
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This differential equation is first order in r. The other element in the correspondence is 
this Schrodinger equation (11.53). The two equations are identical once we identify the 
wavefunction r/r (r, p + , pj ) and the scalar field <p{x , p + , pr)' 



This is the claimed correspondence. 

The quantization of the point particle is an example of first quantization. In first quan- 
tization, the coordinates and momenta of classical mechanics are turned into quantum 
operators and a state space is constructed. Generically, the result is a set of one-particle 
states. Second quantization refers to the quantization of a classical field theory, the result 
of which is a quantum field theory with field operators and multiparticle states. Our analysis 
allows us to see how second quantization follows after first quantization. A first quantiza- 
tion of the classical point particle mechanics gives one-particle states. We then reinterpret 
the Schrodinger equation for the associated wavefunctions as the classical field equation 
for a scalar field. A second quantization, this time of the classical field theory, gives us the 
set of multiparticle states. 

So far we have only quantized the free relativistic point particle. All quantum states, includ- 
ing the multiparticle ones obtained by second quantization, represent free particles. How 
do we get interactions between the particles? Such processes are included in the scalar field 
theory by adding interaction terms to the action. All the terms that we have included so far 
are quadratic in the fields. The interaction terms include three or more fields. Since the 
quantum point particle state space does not include multiparticle states, the description of 
interactions in the language of first quantization is not straightforward. On the other hand, 
in the framework of quantum field theory interactions are dealt with very naturally. 



Since the point particle Lagrangian L in (11.3) depends only on r derivatives of the 
coordinates, it is invariant under the translations 



with c 1 ' constant. The conserved charge associated with this symmetry transformation is 
the momentum p jJL of the particle. This follows from (8.16) and ( 1 1.4). 

What happens to conserved charges in the quantum theory? They become quantum opera- 
tors with a remarkable property: they generate, via commutation, a quantum version of the 
symmetry transformation that gave rise to them classically! 

This property is most apparent if we use a framework where the manifest Lorentz invari- 
ance of the classical theory is preserved in the quantization. This is not the framework we 




(11.56) 



11.5 Light-cone momentum operators 



<5x M (r) = , 



(11.57) 
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have used to quantize the point particle. In light-cone gauge quantization, the x° and x 1 
coordinates of the particle are afforded special treatment. This hides the Lorentz invariance 
of the theory from plain view. We will not discuss fully the Lorentz covariant quantization 
of the point particle. A few remarks will suffice for our present purposes. The covariant 
quantization of the string is discussed in some detail in Chapter 24. 

In the Lorentz covariant quantization of the point particle, we have Heisenberg operators 
x p {r) and //'(T). Note that even the time coordinate x°(r) becomes an operator! The 
commutation relations are 

[x^(r ),p v {x)i = iif v , (11.58) 

as well as 

[x M (r), x y (r)] = 0 and [ ^(r), p v {x)] = 0 . (11.59) 

Equation ( 1 1 .58) is reasonable. The indices match, which ensures consistency with Lorentz 
covariance. Moreover, when // and v take spatial values, the commutation relations are the 
familiar ones. We already know that (11.58) is not consistent with the light-cone gauge 
commutators of Section 11.3. We saw there that [x + (r), p~( r)] = 0, while (1 1.58) would 
predict a nonzero result. An equality of two objects which carry Lorentz indices applies 
when the indices run over the light-cone values +, — , and I . The equation R pv = S llv , 
gives, for example, R ~ 1 = ,S H (Problem 10.3). As a result, equation (11.58) indeed 
gives [x + (t), p~(r)] = ir z -1 = — i. Let us now check that the operator p p (r) generates 

translations. More precisely, we check that ie p p p { r) generates the translation (1 1.57): 

8x p (t) = [ie p p p ( r) ,x M (r) ] = ie p (-iii pp ) = . (11.60) 

This is an elegant result, but it is by no means clear that it carries over to our light-cone 
gauge quantization. We must find out whether the light-cone gauge momentum operators 
generate translations. 

For this purpose, we expand the generator ie p p p (t) in light-cone components: 

ie p p p ( r) = — ie~ p + — ie + p~ + ie r p 1 . (1 1.61) 

We have dropped the r arguments from the momenta because they are r-independent. Note 
that here, p~ is given by (1 1.31). Let us test (1 1.60) with e 1 ^ 0, and e + — e~ = 0: 

8x p (t) = ie*[ p 1 , x^(r) ] . (11.62) 

We would expect that 8x J ( r) = e J and that 8x + (r) = 8x~(r) = 0. All these expectations 
are realized. Choosing p = J, and using the commutator (1 1.28) we find Sx J (r) = e J . To 
compute the action on x + (r) and x“(r), we must use their definitions: 

x + (r) = ^ T r, x“(r)=x ( r + ^ ? r. (11.63) 

m z m z 

Recalling that p 1 commutes with all momenta and with x ( ~ , we confirm that i5x + (r) = 
Sx~( r) = 0. 

Quick calculation 11.4 Test (11.60) with e~ ^ 0 and e + = e 1 = 0. To do this compute 
8x p (t) = —ie~[p + ,x M (r)]. Conhrm that Sx~( r) = e~ and that all other coordinates 
are not changed. 



228 



The relativistic quantum point particle 



It remains to see whether p~ generates the expected translations. Since p~ is a nontrivial 
function of other momenta, there is some scope for complications! This time we consider 
the transformations that are generated using (11.60) with e + ^ 0 and e~ — e 1 — 0: 

&x tJ '(x) = -ie + [p~ ,^(r)]. (11.64) 



The naive expectation c5x + (r) = e + is not realized: choosing p, = + and using (1 1.63) we 
see that 



Sx 



+(x) = -ie+\ p~ , p + ^] =0. 
L m z J 



(11.65) 



Not only is x + (r ) left unchanged, but the other components, which naively should be left 
unchanged, are not: 



Sx 1 (r) = 
Sx~( x) — “ 



■,x 7 (r)j = (-2 ip 1 ) = 



-_,+ P 



m- 



p ,x Q + —x 



,+ P 



= ~ie [p ,x 0 ] = — . 

p+ 



(11.66) 

(11.67) 



In these calculations only one step requires some explanation. How do we find [ p ~ , x { 7 ] ? 
The only reason p~ does not commute with x^ is that p~ depends on p + . In fact, what 
we need to know is the commutator , 1 / p + \. This can be found as follows: 



r- ! i - 1 1 - 1 + - 1 1 - + 1 



(11.68) 



Quick calculation 11.5 Use (1 1.68) to show that 



[*o - 




(11.69) 



Equations (11.65), (11.66), and (1 1.67) show that p~ does not generate the expected trans- 
formations. What happened? It turns out that p~ actually generates both a translation and 
a reparameterization of the world-line of the particle. We know that the particle action 
is invariant under changes of parameterization r — >- r'(r). When we described symme- 
tries in Chapter 8, however, we exhibited them as changes in the dynamical variables 
of the system. A change in parameterization can also be described in that way. Writing 
r -* r' = r + A(r), with /, infinitesimal, we note that the plausible change 

x^(x) — » x^{x + X{x))=x> J -{x) + X{x)d z x^{x), (11.70) 

leads us to write 

Sx ll (x) = X(x)d T x tl ( x). (11.71) 

We claim that these are symmetries of the point particle theory. Actually, the variation 
(11.71) does not leave the point particle Lagrangian invariant. The Lagrangian changes 
into a total r derivative (Problem 1 1.4), but this, in fact, suffices to have a symmetry (see 
Problem 8.9). 
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Let us now show that p~ generates a translation plus a reparameterization. The expected 
translation was <5.r + = e + . On the other hand, from (11.71), a reparameterization of 
x+ gives Sx + — Xd T x + . Bearing in mind (11.65), the expected translation plus the 
reparameterization give zero variation, so, 

n+ 

0 = e+ + L3 T x + (r) = e + + A.^- — » 1 = -e + . (11.72) 

m z p + 

The reparameterization parameter X turns out to be a constant. We can now use this result to 
“explain” the transformations (1 1.66) and (1 1.67) that p~ generates on x 1 and on x~. For 
these coordinates there is no translation, but the reparameterization still applies. Therefore, 

2 1 I 

r i 777 , D i p 

Sx 1 (r) = Xdrx’ix) = -e + = -e + ^~ , (11.73) 

/?+ m z p+ 

m 2 , p~ , p~ 

Sx~(r) = Xd T x~(r) = e + = -e + ^~ , (11.74) 

p + m l p + 

in perfect agreement with the transformations generated by p~ . We can also understand 
why p~ does not change x + . If x + had been changed by a constant c + , the new x + coor- 
dinate would not satisfy the light-cone gauge condition whereby x + is just proportional 
to r. In fact, p~ generates a translation plus the compensating transformation needed to 
preserve the light-cone gauge condition! That transformation turned out to be a reparame- 
terization of the world-line. 

One final remark about momentum operators. The Lorentz covariant momentum oper- 
ators that we used to motivate our analysis generate simple translations and commute 
among each other. It follows directly that, using light-cone coordinates, the operators 
p ± — (p° ± /;' )/'j2 and the transverse p 1 all commute. The light-cone gauge momentum 
operators we discussed above are completely different objects. They had an intricate action 
on coordinates, and p~ was defined in terms of the transverse momenta and p + . Never- 
theless, all the light-cone gauge momentum operators still commute. They obey the same 
commutation relations that the covariant operators do when expressed using light-cone 
coordinates. 



11.6 Light-cone Lorentz generators 



In Section 8.5 we determined the conserved charges that are associated with the Lorentz 
invariance of the relativistic string Lagrangian. Similar charges exist for the relativistic 
point particle. As we found in (8.52), the infinitesimal Lorentz transformations of the point 
particle coordinates x , 1 (t) take the form 



Sx^ir) = e^ v x v (r) , 



(11.75) 
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where e pv = —e vp are a set of infinitesimal constants. The associated Lorentz charges are 
given by 

M‘ uv = x»{t)p v (t) - x v (t ) p p (r ) , (11.76) 

as you may have derived in Problem 8.5. These charges are conserved classically. The 
quantum charges are expected to generate Lorentz transformations of the coordinates. 
Again, it is straightforward to see this using the operators of Lorentz covariant quanti- 
zation. In this case, the quantum charges are given by (1 1.76) with x p (r ) and p p (r) taken 
to be the Heisenberg operators introduced earlier and satisfying the commutation relations 
(1 1.58) and (1 1.59). Both x ll (r) and p p (r) are Hermitian operators. The Lorentz charges 
M IIV are Hermitian as well: 

(M /^)t = p v ( T ) x ^(r) - p H( T )x u (r) = M^ v , (11.77) 

since the two constants induced by rearranging the coordinates and momenta back to the 
original form cancel out. 

Quick calculation 11.6 Show that 

[ M flv , x p (r) ] =ir j flp x v (T)-ir, vp x fl (T). (11.78) 

This commutator helps us check that the quantum Lorentz charges generate Lorentz 
transformations: 

Sx p (z)=[- l -e lxv M pv ,x p ( r)] 

= ^^(^^(r)- p vp x p { r)) 

= ^e pv x v (r) + ^e pp Xll (r) = e pv x v (r) ■ ( 11 . 79 ) 

Equation (1 1.78) can be used in light-cone coordinates by simply using light-cone indices. 
For example, 

[M _/ ,x+( r)] = i p~ + x 1 (r) — i p I+ x~ (r) = -i x 1 ( r) , (11.80) 

since rj I+ — 0. The operator M~ I here is a Lorentz covariant generator expressed in light- 
cone coordinates. It is not a light-cone gauge Lorentz generator. Those we have not yet 
constructed. 

Given a set of quantum operators, it is interesting to calculate their commutators. In quan- 
tum mechanics, for example, you learned that the components L x , L y , and L z of the 
angular momentum satisfy a set of commutation relations ([ L x , L v ] = i L z , and others) 
that define the Lie algebra of angular momentum. The momentum operators p p consid- 
ered earlier define a very simple Lie algebra: they all commute. We would like to know 
what is the commutator of two Lorentz generators. The computation takes a few steps 
(Problem 1 1.5). Using equation (1 1.78), and a similar equation for [M pv , p p \, one finds 
that the commutator can be written as a linear combination of four Lorentz generators: 



[M p \ M pa ] = irj pp M va - ip vp M pa + M pv - ip va M pp . 



(11.81) 
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This result defines the Lorentz Lie algebra. Equation (11.81) must be satisfied by the 
analogous operators M^ v of any Lorentz invariant quantum theory. If it is not possible 
to construct such operators, the theory is not Lorentz invariant. This will be crucial to our 
quantization of the string, for requiring that (11.81) holds imposes additional restrictions, 
which have significant physical consequences. 

Quick calculation 11.7 Since M I1V = — M u/ L the left-hand side of (11.81) changes sign 
under the exchange of fi and v. Verify that the right-hand side also changes sign under this 
exchange. 

We can now use (11.81) to determine the commutators of Lorentz charges in light-cone 
coordinates. The Lorentz generators are given by 

M 1J , M +I , M~ l , and M + ~ . (11.82) 

Consider, for example, the commutator [M -1 , M +I ], To use (1 1.81) notice the structure 

of its right-hand side: each 77 contains one index from each of the generators in the left- 
hand side. For [M H , M +I ], the only way to get a nonvanishing i) is to use the — from the 
first generator and the + from the second generator. The nonvanishing term is the second 
one on the right-hand side of (1 1.81), and we find 

[M + ~,M +I ] = -ir 1 ~+M +I = iM +I . (11.83) 

Similarly, 

[M -/ , M~ J ] = 0 . (11.84) 

Here 77 must use the I and J indices, but then the other two indices must go into M giving 
us M , which vanishes by antisymmetry. 

Quick calculation 11.8 Show that M + ~ = M 10 . This shows that M + ~ generates a boost 
along the x l direction. 

So far, we have considered the covariant Lorentz charges in light-cone coordinates. We 
must now find Lorentz charges for our light-cone gauge quantization of the particle. Our 
earlier discussion of the momenta suggests that we really face three questions. 

(1) How are these charges going to be defined? 

(2) What kind of transformations will they generate? 

(3) Which commutation relations will they satisfy? 

In the remaining part of this section we will explore question (1) in detail. Before doing 
so, let us give brief answers to questions (2) and (3), leaving further analysis of these ques- 
tions to Problems 1 1.6 and 1 1.7. The light-cone gauge Lorentz generators are expected to 
generate Lorentz transformations of coordinates and momentum, but in some cases, these 
transformations will be accompanied by reparameterizations of the world-line. Regarding 
(3), the light-cone gauge Lorentz generators will satisfy the same commutation relations 
that the covariant operators in light-cone coordinates do. This establishes that Lorentz sym- 
metry holds in the light-cone theory of the quantum point particle. The success of the 
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construction is not obvious a priori. It is not clear that the reduced set of light-cone gauge 
operators suffices to construct quantum Lorentz charges that generate Lorentz transforma- 
tions (plus other transformations) and satisfy the Lorentz algebra. 

The simplest guess for the light-cone gauge generators is to use light-cone coordinates in 
the covariant formula (1 1.76) and then replace x + (r), x~(r), and p~ using their light-cone 
gauge definitions in (1 1.29), (1 1.30), and (1 1.31). Let us try this prescription with M" 1 : 

M + ~ = x + (r) p~(r) — x~(r) p + ( r) 

? P +r - ( - , P~ \ + 

= y P ~ 1*0 + — r )P 

= -x~p + . (11.85) 

Since xL and /; + are r -independent, so too is M" 1 . We have a minor complication, how- 
ever. The operator M" 1 is not Hermitian: (M H ) ' — AT 1 = [x^~ , p + ] ^ 0. This failure 
of Hermiticity illustrates how the use of the light-cone gauge can affect basic properties of 
operators. The covariant Lorentz generators were automatically Hermitian, the light-cone 
gauge generators are not. We are therefore motivated to define a Hermitian M" 1 as 

M + ~ = -}^{XqP + + p + x~) . (11.86) 

We take this to be the light-cone gauge Lorentz generator M" 1 . 

The most complicated of all generators is M -/ . It is the most interesting one as well. 
The prescription used for M" 1 this time gives 

M -/ = x~{x) p 1 — x 1 (r) p~ 

= XqP : -XqP~. (11.87) 

As before, the r-dependence vanishes, but we are left with a complicated result since p~ 
is a nontrivial function of the other momenta. We define M~ ! as the Hermitian version of 
the operator obtained above: 

M ~‘ = x 0 P 1 - \ ( x o p~ + p~ x o) ■ (11.88) 

If the light-cone gauge Lorentz charges are to satisfy the Lorentz algebra we must have 

[M _/ , M“ 7 ] = 0, (11.89) 

as we noted in (1 1.84). Does M -/ , as defined by (1 1.88), satisfy this equation? The answer 
is yes, as you will see for yourself in Problem 11.6. This result is necessary to ensure 
Lorentz invariance of the quantum theory. All other commutators of Lorentz generators 
also work out correctly. 

The calculation of [ M -/ , M~ J ] in quantum string theory is fairly complicated, but the 
answer is very interesting. It turns out that this commutator is zero if and only if the string 
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propagates in a spacetime of some particular dimension and, furthermore, only if the defini- 
tion of mass is changed in such a way that we can find massless gauge fields in the spectrum 
of the open string! String theory is such a constrained theory that it is only Lorentz invariant 
for a fixed spacetime dimensionality. 



Problems 



Problem 11.1 Equation of motion for Heisenberg operators. 

Assume that the Schrodinger Hamiltonian // = H (p, q) is time independent. In this 
case the time-independent Schrodinger operator £ yields a Heisenberg operator q (t) = 
e lHr ^e~ lHr . Show that this operator satisfies the equation 

d$(t) r 1 

= L* (0, ■ 

This computation proves that equation (11.18) holds for time-independent Hamiltonians. 

Problem 11.2 Heisenberg operators and time-dependent Hamiltonians. 

When the Schrodinger Hamiltonian H = Hip. q\ t) is time dependent, time evolution of 
states is generated by a unitary operator U ( t ): 

I*, f) = £/(*)!*> , (l) 

where U (?) bears some nontrivial relation to H . Here |VP) denotes the state at zero time 
and U (0) = 1. 



(a) Use the Schrodinger equation to show that 



dU(t) 

i 

dt 



= HU{t). 



( 2 ) 



Let U = U{t), for brevity. Since U~ l acting on T. t) gives a time-independent state, 
considerations similar to those given for ( 1 1 .24) lead us to define the Heisenberg operator 
corresponding to the Schrodinger operator a as 

u(t) = U~ l aU. (3) 



(b) Let £ be a time-independent Schrodinger operator, and let £(f) be the corresponding 
Heisenberg operator, defined using (3). Show that 

d£(t) r 1 

i = [Ht), H(p(t),q(ty,t) ■ 

This computation proves that equation (11.18) holds for time-dependent Hamiltonians. 

(c) If [a | , c ^2 ] = «3 holds for Schrodinger operators ot\ .ot 2 , and aq, show that 
[a i (r), oi 2 (01 = «3 (f) holds for the corresponding Heisenberg operators. 
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Problem 11.3 Classical dynamics in Hamiltonian language. 



Consider a classical phase space (q, p), a trajectory (q(t), p(t)), and an observable 
v ( q{t ), pity, t). From the standard rules of differentiation. 



dv dv dv dp dv dq 

dt d 1 dp dt dq dt 



( 1 ) 



With the Poisson bracket defined as 



dA d B d A d B 

dq dp dp dq 



(2) 



show that 



dv dv 

— = \- [v, H] . 

dt dt 



( 3 ) 



Comparing this result to (1 1.19) we see the parallel between the time evolution of a general 
operator O and the classical Hamiltonian evolution of an observable v in phase space. 



To derive (3) you need the classical equations of motion in Hamiltonian language. These 
can be obtained by demanding that 

J dt{p{t)q(t) - H(p(t),q(t);t)J 

be stationary for independent variations Sq(t) and Sp(t). 

Problem 1 1.4 Reparameterization symmetries of the point particle. 

Show that the variation 8 x p (r) — ).(T)d z x p (r) induces a variation 8L of the point particle 
Lagrangian that can be written as 

8L(r) = 3 T (k(r)L(r)) . 

This proves that the reparameterizations 8x p are symmetries in the sense defined in 
Problem 8.9. Show, however, that the charges associated with these reparameterization 
symmetries vanish. When X is t - independent, the reparameterization is an infinitesimal 
constant r translation. The conserved charge is then the Hamiltonian. Show directly that 
the Hamiltonian defined canonically from the point particle Lagrangian vanishes. 

Problem 11.5 Lorentz generators and Lorentz algebra. 

In this problem we consider the Lorentz covariant charges (1 1.76). 

(a) Calculate the commutator [ M pv , p p ], 

(b) Calculate the commutator [ M pv , M pa ] and verify that (11.81) holds. 

(c) Consider the Lorentz algebra in light cone coordinates. Give 

[M ±! , M jk ], [M ±i ,M tJ ], [M + ~ , M ±7 ] , and [M ±r , M ±J ], 

Problem 11.6 Light-cone gauge commutator [M -/ , M~ J ] for the particle. 

The purpose of the present calculation is to show that 
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[ M -/ , M~ J ] = 0. 

(a) Verify that the light-cone gauge operator M -/ takes the form 



( 1 ) 




( 2 ) 



Set up now the computation of (1) distinguishing the two kinds of terms in (2). 
Calculate the contributions to the commutator from mixed terms and from the last 
term. 

(b) Complete the computation of (1) by finding the contribution from the first term on the 
right-hand side of (2). 

Problem 11.7 Transformations generated by the light-cone gauge Lorentz generators 

M+~ and M~‘ . 

(a) Calculate the commutator of M" 1 (defined in (11.86)) with the light-cone coor- 
dinates x + (r), x _ (t), and x 7 (r). Show that M" 1 generates the expected Lorentz 
transformations of these coordinates. 

(b) Calculate the commutator of M -/ with the light-cone coordinates x + (r), x _ (r), and 
x J (t). Show that M -/ generates the expected Lorentz transformations together with 
a compensating reparameterization of the world-line. Calculate the parameter k for 
this reparameterization. [Hint: the reparameterization takes the “hermiticized” form 



8x^(t) = j(k 3 t x m + 3 r x M k).] 



r 

12 Relativistic quantum open strings 

L 



We finally quantize the relativistic open string. We use the light-cone gauge to 
set up commutation relations and to define a Hamiltonian in the Heisenberg pic- 
ture. We discover an infinite set of creation and annihilation operators, labeled 
by an integer and a transverse vector index. The oscillators corresponding to the 
X~ direction are transverse Virasoro operators. The ambiguities we encounter 
in defining the quantum theory are fixed by requiring that the theory be Lorentz 
invariant. Among these ambiguities, the dimensionality of spacetime is fixed to 
the value 26, and the mass formula is shifted slightly from its classical coun- 
terpart such that the spectrum admits massless photon states. The spectrum also 
contains a tachyon state, which indicates the instability of the D25-brane. 



12.1 Light-cone Hamiltonian and commutators 



We are at long last in a position to quantize the relativistic string. We have acquired con- 
siderable intuition for the dynamics of classical relativistic strings, and we have examined 
in detail how to quantize the simpler, but still nontrivial, relativistic point particle. More- 
over, having taken a brief look into the basics of scalar, electromagnetic, and gravitational 
quantum fields in the light-cone gauge, we will be able to appreciate the implications of 
quantum open string theory. In this chapter we will deal with open strings. We will assume 
throughout the presence of a space-filling D-brane. In the next chapter we will quantize the 
closed string. 

Just as before, we will interpret the classical equations of motion in the light-cone gauge 
as equations for the appropriate Heisenberg operators. It is therefore necessary for us to 
review the results of our light-cone analysis of the classical relativistic string. 

We found a class of world-sheet parameterizations (9.27) for which the equations of motion 
are wave equations X 11 — X 1 ' = 0. This remarkable simplification came at the expense 
of two constraints: (X ± X') 2 — 0. With these constraints, the momentum densities are 
simple derivatives of the coordinates: 

_ \ — x^' V Tll = —^—X 11 . (12.1) 

2 na' 2tux' 

These equations hold in all gauges within the class we considered. In particular, they are 
true in the light-cone gauge. For open strings in the light-cone gauge, we set X + = 2a'p + r 
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and solved for X in terms of the transverse coordinates X 1 . Indeed, using (9.65) with 
yS = 2, we have 



1 1 * / * / il r/ 

x~ = —^^(X'x 1 +X 1 X 1 ). 



2a' 2 p+ 

This gives us an explicit expression for V T ~ : 
1 . Ill 



(12.2) 



2tux' 


X = 

2 jta' 2a' 2 p + 


71 


( V n V rl 


~ 2 P +l 


V ' + (2 Jta')*) 



X r X 

( 27TQ ! ') 2 



(12.3) 



These equations will soon become useful. 



As a first step in defining a quantum theory of the light-cone relativistic string, we must give 
the list of Schrodinger operators. Motivated by the list (1 1.25) of Schrodinger operators for 
the quantum point particle, we choose our r -independent Schrodinger operators to be 



Schrodinger operators: ^ X 1 (a), , V rI (o), P + ^j ■ (12.4) 



The associated Heisenberg operators are then 

Heisenberg operators: ^ X 1 (r, a), x^(r), V T, (r , er), p + (j) V (12.5) 



Because the operators (12.4) have no explicit r dependence, neither do the Heisenberg 
operators (12.5). As in the case of the point particle, we expect xT and p + to be fully 
r -independent Heisenberg operators. 



Now we set up the commutation relations. For the Schrodinger operators X 1 (a) and 
V zI (a) we must face the fact that these operators have cr dependence. It is reasonable 
to demand that such operators fail to commute only if they are at the same point along the 
string. We do not expect (simultaneous) measurements at different points on the string to 
interfere with each other. Therefore we set 



[x 1 (a) ,V zJ {o') 



ii) IJ <$(er — a') . 



( 12 . 6 ) 



Here the delta function is being used to implement the constraint that the commutator must 
vanish when a ^ a' . We had to use a Dirac delta function, as opposed to a Kronecker 
delta, since a is a continuous variable. Equation (12.6) is naturally supplemented with the 
commutation relations 




X J (o') 



[v Tl (o). 



V 



T V')] 



= 0, 



(12.7) 





238 



Relativistic quantum open strings 



and 

[x~,p + ] = -i. (12.8) 

The operators x^ and p + commute with all of the other Schrodinger operators: 

[x-,X 7 (a)] = ,P r/ (<r)] = [p+,Z / (a)] = [ p+ , V rI (a) ] = 0 . (12.9) 

For the associated Heisenberg operators, the only nonvanishing equal-time commutation 
relations are therefore 






= i r\ 1] <5(cr — a ') , 



(12.10) 



as well as 



[x 0 (r) , p + ( r)] = —i . 

All other commutators vanish: 



[x\r, 


cr), X y (r, <t')] = 


[■ P t/ (t,ct),P t7 (tV)] 




[*0 ( r ) ’ X 1 (r, cr)] 




[x 0 (r),'P T/ (r, cr)j 


\ 


p + ( r),X 7 (r, CT)j 


= 1 


p + ( r),'P T/ (r, a)j 



(12.11) 



(12.12) 



We must now invent the Hamiltonian. Our Hamiltonian should generate r translation. From 
our experience with the point particle, we know that p~ generates X + translation. But in 
the light-cone gauge X + = 2u' p + T , so 



9 dX+ 9 „ . , 9 

— = r = 2 a p + . 

9r 9r dX+ 1 dX+ 

It follows that the Hamiltonian that generates change in r is expected to be 



(12.13) 



H = 2a'p + p- = 2ol' P + / daV z ~. (12.14) 

Jo 

This will indeed turn out to be the correct string Hamiltonian. Using (12.3), the Hamilto- 
nian can be written more explicitly as the Heisenberg operator 



H( r) = lux 



'j> ( pT '< 



(r, (j)V zI (r, a) + 



X I \r,a)X I \r , cr) 



(2na') 



i\2 



)■ 



(12.15) 



H must generate quantum equations of motion that are operator versions of the classi- 
cal equations of motion. H is very simple when expressed in terms of the transverse 
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Virasoro modes of Chapter 9. There we saw that / J( | = 2 a 1 p + p (equation (9.78)), so 
(12.14) immediately gives 

H = L£. (12.16) 

This expression for the Hamiltonian is perhaps the most memorable one, though, as we 
will see later on, the true Hamiltonian is slightly different. The operator products W and 
X'X' in (12.15) are actually ambiguous operators and need careful definition. Additionally, 
Lorentz invariance will require the subtraction of a calculable constant from H. 

Now that we have a plausible candidate for the Hamiltonian, we have to derive the equa- 
tions of motion. Any Heisenberg operator f (r, cr) which arises from a time-independent 
Schrodinger operator ^(cr) must satisfy 

i cr) = [£(r, a ) , H(r) ] , (12.17) 

where H(r) is given in (12.15). Since H(r) is built from Heisenberg operators that have 
no explicit time dependence, we can substitute H( r) for f (r, cr) in (12.17). We conclude 
that it is completely time independent: H( r) = H . Furthermore, we can see that xX(t) 
and p + ( r) commute with H. They are therefore time independent operators, so we will 
henceforth denote them by x ( J and /; + . The commutator (12.1 1) then becomes 

[xo,p+] = -i. (12.18) 



The Heisenberg equation of motion for X 1 (r, cr) is 

iX 1 (j, a) = [A / (r, cr) , H( r)] = [ X 7 (r, a ) , not' J da' V rJ (r, a')V rJ (t, 
where we dropped the second term in H since it commutes with X 1 (t, cr): 



\x\r, cr) , X j '(t, a') = \x\r, a ) , X J (r, a') 



= 0 . 



(12.19) 



We also reinserted the time parameter in H( r), choosing a time that gives easily evaluated 
equal-time commutators. Making use of (12.10) we find 



i X ! (r, a) = na' ■ 2 • f da'V lJ (r, a’) i )] ,J S(a — o'). 

Jo 



( 12 . 20 ) 



Performing the integral and cancelling the common factor of i, we find 

X'(x,a) = 2 ttu'V tI (t, a). (12.21) 



Happily, this coincides with the classical equation of motion (12.1). The other equations of 
motion can be checked in a similar fashion. For example, you can calculate V zI and use 
the result to verify that 



X’ - X 1 " = 0 



( 12 . 22 ) 
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is the quantum equation of motion (Problem 12.1). As we turn classical string theory 
into a quantum theory, the classical boundary conditions become operator equations. For 
example, the Neumann boundary conditions 

d a X I (x,o) = 0, a—0,: r, (12.23) 

are taken literally as the condition that the operator 3 0 A r/ (r, a) vanishes at the open string 
endpoints. 



We learned in Chapter 9 that the linear combinations of derivatives (X 1 ± X 1 ') are par- 
ticularly simple and useful. We conclude this section by calculating commutators of these 
derivatives. We begin by using (12.21) to rewrite the commutator (12.10) as 



X 1 (r, cr) , X J (x, a') = 2na' i r] IJ S(a — a') . 



(12.24) 



Taking the a derivative of this equation yields 




= 2 na' i ij IJ 



d , 

—S((J -a), 
do 



(12.25) 



Differentiating [Z / (r, a), X J (x, cr')] =0 with respect to a and a' and recalling that 
[ V zI (r, cr), V tJ (r, cr')] = 0, we find that r and a derivatives of the coordinates separately 
commute among themselves: 



[ X ! \x, cr) , X J \x, o') = [ X ! (x, cr) , X J (x, a') 

')(*, cr')] , 



= 0 . 



Now we examine the commutator 



(12.26) 



[(A / + Z /, )(r,or), (X J +X J ' 

which as a consequence of (12.26) equals 

[ X ! (r, cr) , X J \x, a’) j + [ X r (r, a ) , X J (r, a') 

The second term is given by (12.25). The first term equals 

— a ') , X 1 {x, a) = — (2jxci / )ii] J 1 8(a f — a) = 2na f irj IJ — <5(cr — o') . 

L J da' da 



(12.27) 



(12.28) 



To obtain this result we noted that a a' derivative can be traded for minus a a derivative 
when it acts on a function of (a — a'). Moreover, we used <5(x) = 5(— x). We now see that 
both terms in (12.28) are equal, so 



[ (X ! + X ! '){x, a ) , (X J + X J '){x , a') 



Ana' irj IJ 



d 

da 



S(a — a') . 



(12.29) 



In fact, more generally, we have found that 



[ ( X 1 ± X V )(x, a ) , (X J ± X J ')(x, a') ] 



idTrc/ irj 1 ' 1 



d 

da 



8(a — a ') , 



(12.30) 
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since only cross terms contribute. Finally, 



[ ( X 1 ± X V )(t, a ) , ( X J =f X J ')(r, o') ] = 0 . (12.31) 



Equations (12.30) and (12.31) hold for o, o ’ e [0, i r]. 



12.2 Commutation relations for oscillators 



The commutation relations written so far are delicate to handle because they involve field 
operators and use delta functions. They are an infinite set of relations which hold for con- 
tinuous values of a and o'. It is therefore useful to recast them in discrete form, namely, 
as a denumerable set of commutation relations. For this purpose we will examine the 
mode expansions of Section 9.4. These followed from the classical wave equations and the 
boundary conditions for a space-filling D-brane. Since the wave equations and the bound- 
ary conditions continue to hold in the quantum theory, we can use the mode expansions 
in the quantum theory. The classical modes a 7 , however, become quantum operators with 
nontrivial commutation relations. 



Recall our solution (9.69) to the wave equation with Neumann boundary conditions: 



X 1 (r, a) — x q + 7 2a' Kq r + iV , 2ot' ^ -a 7 



cos no e 



(12.32) 



n^O 



In addition, from (9.74) we have 

(X 1 + X 7 ')(t, o) = 72a 7 Y a h e“"' (r+cr) , o e[ 0, tt] , 

neJj 

(. X 1 - X ! ')(r, o) = 72a 7 Y a h e ~ in(T ~ a) , a e [0, it] . 

neJj 



(12.33) 



The above equalities hold for a e [0, tv] because the open string coordinates are only 
defined for o e [0, re]. We will now construct a function of o with period 2 tt which is 
naturally expressed in terms of open string coordinates. To do this, we evaluate the second 
equation above at —o: 

(X 1 -X i ')(t,-o) = VM Y a h e ~ in(X+a) ’ a e [ _7r ’ 0 ]- (12.34) 

ne Z 



The range a £ [—7 r, 0] is required because otherwise the left-hand side is not defined. We 
now define the operator A 1 ( r, o) by 

A 7 (t, a) = 72 o' Y^^n e~" ,(z+a ^ , A 1 ( r, a + 2: r) = A 1 (r, o) . 

ne Z 



(12.35) 
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The indicated periodicity is a direct consequence of the definition. It is now simple to relate 
A 1 to open string coordinates over the length- 2 tt interval a e [— tt, tt], For a e [0, 7r] we 
use the top equation in (12.33) and for cr e [— n , 0] we use (12.34): 

( X 7 + X I ')(x, a) a e [ 0 , tt ] 

(X 1 — X ! ')(x, —a) a e [— tt, 0] . 



A 1 (r, cr) = 



(12.36) 



The operator A 1 will be useful to determine the commutation relations for the a 7 oscilla- 
tors. For this, we must compute the commutator [A 1 (r, a), A J (x, cr')]. Given (12.36) the 
evaluation for the full range cr, a' e [— n, tt] requires four computations: 

[ (X 1 + X V )(x, cr ) , ( X J + X J ') ( r , cr ') ] , cr , cr ' e [ 0 , tt], 

[ (X 1 + X 7 ')(r, cr) , ( X J - X J ')(r, -a') ] . a e (0, Tt], a' e [-tt, 0], 

[ (X 1 - X 1 ')^, -a) , ( X ] + X J ')(r, a')], a e [-tt, 0], cr' e [0, tt], 

[(X 7 -X ! ')(r, -a), (X J -X JI )(x, -a')]. a,a’e[-TT, 0] . (12.37) 

The first commutator, for cr, a' e [0, n], is simply read from (12.30): 

[ (X 1 + X 7 ')(r, a) , (X J + X JI ) (r, a') 1 = 47ra' i r) lJ —S(c r - cr'). (12.38) 

L J da 

The last commutator, for a, a' e [—tt, 0], is also obtained from (12.30): 

d 



[(X 1 - X V )(t, -a) , ( X J - X J '){x, -a 1 ) ] = -4rra' ir 1 IJ 



d(—a ) 



8(—a + cr') 



= 47ra' ir] IJ — 8(a — a'), 
da 

and coincides with the result (12.38) of the first commutator. For the second and third 
commutators in (12.2) we can use (12.31) to conclude that both of them vanish. In fact, the 
right-hand side of (12.38) also vanishes since in those cases a and a' cannot be equal. It 
follows that the result of all four commutators can be summarized as 

[A 1 {x,a), A J (r,a')] = Attoi' — S(a-a'), a, a' € [—tt, tt], (12.39) 

L J da 

Using (12.35) and cancelling a common factor of 2a', the above result gives 



E « 

m'.n'eZ 



—im' (r+cr) e ~i 



m'(T+a') | ’qT, i a J r = 2: X ir] 1 J ~^—8(a - a'). 



(12.40) 



This equation holds for a, cr' e [— tt, tt]. In order to extract information we apply on both 
sides the integral operations 



1 r 2n . 1 r 2n 

da e"" a ■ — / , 

2tt Jo 2 tt Jq 



da e u 



(12.41) 



On the left-hand side of (12.40) the integrals pick the term with = m and n' = n: 



e 



—i (m+n) 



T [«», - «« 



(12.42) 
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On the right-hand side of (12.40) the integrals give 



1 r2n j 

ir ) 11 — / doe ima —\ da' e" m ' S(a — o') 
2 7T Jo da Jo 



= ip IJ — / do e ima —e ina = -n r] IJ — 
2 tt Jo da 2 7t 

— n tj .0 — rrr tj ^m+n,0* 



1 f 27T 

- / doe i(m+n)a (12.43) 
■x Jo 



Equating our results (12.42) and (12.43), we find 
[4 , a,f ] = m rj IJ 8 m+n fi e' 



m i ] 8 m -\-n, o> 



(12.44) 



since the Kronecker delta can be used to set m = —n. Therefore, the commutation 
relation is 



«m - <4 



// o 

— 171 Tj 0/724-/2,0* 



(12.45) 



This is the fundamental commutation relation between a modes. Note that otq commutes 
with all other oscillators. This is quite reasonable: as shown in (9.52), ctq is proportional to 
the momentum of the string 



q!q = V2 a' p 1 , 



(12.46) 



and it is expected to have a nontrivial commutator with xjf only. 

To complete the list of all possible commutators we must find the commutators between 
xl and the oscillators crj . For this, we consider equation (12.24), and integrate both sides of 
the equation over a e [0, jr], On the left-hand side, the terms with oscillators in X 1 (r, a) 
give no contribution, and on the right-hand side the delta function disappears giving a factor 
of one: 

[^ + V2a 7 a^r, = 2a'irj IJ . (12.47) 

Since X J is a sum of terms that contain aj , we have [a^ , X r \ = 0. Additionally, using the 
mode expansion of X J , equation (12.47) becomes 

cosn'a' e~' n T = s/2od i rj IJ . (12.48) 

n'eZ 



Reorganizing the left-hand side of this equation we find 



CXJ 

[4 - “0 ] + J2 [4 . a n' e ~ ln T + a -n' e " 1T 



COS 



no = V 2a' i rj 



I] 



(12.49) 



n'= 1 
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We apply to both sides of this equation the integral operation — f? da cos na, with n > 1. 
We then find that 



cde~ inr 



■«V nT 



= 0 , 



(12.50) 



or equivalently 

[x',a J n ]e- in ' + [x',a J _ n ]e in * = 0 . (12.51) 



Since the left-hand side must vanish for all values of r, each term must vanish separately 
(prove this!). It follows that 



[*0 ] = 



= 0 for n ^ 0 . 



(12.52) 



Additionally, equation (12.49) gives 



./ 



4 o- «o 




(12.53) 



This, together with (12.46), gives the expected commutator 




As in familiar quantum mechanics, the operators and p 1 are Hermitian: 

(*o) t= *o. (/) t = /- (12.55) 

The calculations we performed to obtain the commutation relations took quite a few 
steps, which we explained in detail. When we discuss closed strings, or open strings on 
general D-brane configurations, similar computations will be required. We will be able to 
carry them out using, with minimal modifications, the calculations we just did. 

It is useful at this point to examine in detail the commutation relations (12.45) for the a ] n 
modes. As we will show below, they are equivalent to those of an infinite set of creation and 
annihilation operators. To see this, we begin by defining oscillators, taking our inspiration 
from the classical variables introduced in (9.53): 

a iff = a% *Jn , a^ n =a%*^fn, n> 1. (12.56) 

In these equations, both the a and the a are classical variables. Now they become operators. 
Classical variables that are complex conjugates of each other become operators that are 
Hermitian conjugates of each other in the quantum theory. We can therefore preserve the 
first of the above definitions, but the second must be changed. For our light-cone modes 
p — I we take 

= aj : ~Jn and a!_ n = aj , ' J~n , n > 1 . 



(12.57) 
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Note that, with this definition, 

(«')*=«!„, neZ. (12.58) 

This equation holds for n = 0 because , being proportional to p 1 , is also Hermitian. It 
is useful to emphasize that, while the a ! n modes are defined for all integers n, the a l and 
ah 1 operators are only defined for positive n . 

An important consequence of the above Hermiticity properties is that X 1 (r, er), which 
used to be real in the classical theory, is now a Hermitian operator. 

Quick calculation 12.1 Use the expansion (12.32) and the Hermiticity conditions (12.55) 
and (12.58) to show that 

(X / (T,cr)) t = X 7 (r,cr). (12.59) 

The i factor in front of the sum in (12.32) is needed for this calculation to work out. 



We can now rephrase the commutation relation for the a modes in terms of the oscillators 
(a,{ , a!, ^). For this purpose, rewrite (12.45) as 



[«i , a -n \ = m ^nW J ■ 



(12.60) 



When m and n are integers of opposite signs the right-hand side vanishes, and the two 
operators in the commutator have mode numbers of the same sign. Therefore, we learn that 



- I ^ « 7t 

» 1/1 n 



= o. 



= [ a l 

If both m and n are positive in (12.60) we find 

[ Vm u' m , Jn af t 1 j = mS nlM rj IJ . 
Moving the square roots to the right-hand side 



[aL 



: .a 11 



1J 



(12.61) 



(12.62) 



(12.63) 



Since the right-hand side vanishes unless m = n, it simplifies to 



[ain, =K, n ri IJ . (12.64) 

This, together with (12.61), shows that (al n , a 2) satisfy the commutation relations of 
the canonical annihilation and creation operators of a quantum simple harmonic oscilla- 
tor. There is a pair of creation and annihilation operators for each value m > 1 of the 
mode number and for each transverse light-cone direction 1. The commutation relations 
are diagonal: oscillators corresponding to different mode numbers, or to different light- 
cone coordinates, commute. If the mode numbers and the coordinate labels agree, the 
commutator is equal to one. In terms of the a operators, with n > 1 : 
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a J t are annihilation operators, 
ot!_ n are creation operators (n > 1). 



(12.65) 



For future reference let us rewrite the expansion of X^T.a) in (12.32) in terms of 
creation and annihilation operators. Separating out the sum over all integers into sums over 
positive and negative integers, and using (12.46), we find 

OO 

X 1 (r, cr) = Xq + 2a' p 1 r + i V 2a' e~ ,nT — a!_ n e lnT ^j — — . (12.66) 

71=1 

Replacing a modes by the corresponding oscillators we obtain 

CO 

X 1 (r, cr) = Xq + 2a' p 1 r + iV2a' J e~ ,nT — aj^ e‘" T ^ — j=— . (12.67) 

71 = 1 

This is the expansion of the coordinate operator in terms of creation and annihilation oper- 
ators. 

Let us take stock of what we have learned. The list of operators we started with was given 
in (12.5). We have seen that the operators X 1 (r, cr) and V xI (r, cr) can be traded for an 
infinite collection of oscillators, plus pairs of zero modes (x^, p 1 ). Since the other two 
operators in the list, xT and p + , are also zero modes, the full set of basic operators of 
string theory is a collection of zero modes plus an infinite set of creation and annihilation 
operators. This result is so important that we will now derive it in a different way, showing 
explicitly how the quantum simple harmonic oscillators arise. 



12.3 Strings as harmonic oscillators 



Our aim here is to give a more physical derivation of the results obtained in the previous 
section. In particular, we will rederive the mode expansion (12.67) and the commutation 
relations between the operators in that expansion. These results followed from the fun- 
damental commutation relation (12.10) together with the operator equations of motion 
(12.22) and the operator boundary conditions (12.23). Of these, the commutation relations 
(12.10) are perhaps the least intuitive, as they involve a delta function. In the derivation 
below there will be no delta function. 

Here is our strategy. We will invent a simple Lagrangian that describes the dynamics of 
the light-cone coordinates X 1 . This is not such a difficult task since we know the equa- 
tions of motion of the X 1 , their boundary conditions, and the definition of the canonical 
momenta V zI . Then we will expand the coordinate X 1 (r, a) as a function of a but with r- 
dependent expansion coefficients. Using the Lagrangian, we will show that those expansion 
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coefficients are actually the coordinates of harmonic oscillators that have ever-increasing 
energy! We will conclude by relating these oscillators to the creation and annihilation 
operators obtained in the previous analysis. 

To set up the notation, we begin by reviewing the basic properties of the quantum har- 
monic oscillator. Let q n (f) be the coordinate of a classical simple harmonic oscillator, and 
let the action be given by 

Sn = J L n (t ) dt = J dt Q- q 2 n {t ) - £ ^(f)) • (12.68) 

We recognize this as a harmonic oscillator because the kinetic energy is proportional to the 
velocity-squared, and the potential energy is proportional to the coordinate-squared. For 
this Lagrangian, the momentum p n conjugate to the coordinate q n is 

dL 1 

Pn = = - q,i ■ (12.69) 

dq n n 

A little calculation now gives the Hamiltonian as 

Hn(.Pn i qn) — Pn qn L n — — (p n + q n ) ■ (12.70) 

In this equation, n plays the role of the frequency co of the harmonic oscillator. To define 
the quantum oscillator, we introduce Schrodinger operators q n and p n , with the canonical 
commutation relation 

[q n , Pn] = i ■ (12.71) 



Creation and annihilation operators can be introduced as 



d n 



1 

n 



(. Pn - iq n ) , 



at 



1 

7 ! 



(Pn + iqn) ■ 



(12.72) 



You should check that as a consequence of (12.71) the creation and annihilation operators 
satisfy the commutation relation 

[«„ , cQ = 1 . (12.73) 



Inverting the relations in (12.72), we find 



l | la. 

qn = ~ > Pn = + a n ) ■ 



(12.74) 



These can be used to rewrite the Hamiltonian H n in terms of the creation and annihilation 
operators. We find the familiar result 



H n = n(^ata n + . 



(12.75) 



We can now consider the Heisenberg operators (a„(f), a\(t)) that are associated with the 
Schrodinger operators (a n . at). As emphasized in Section 11.2, the Heisenberg operators 
satisfy the same commutation relations as the Schrodinger operators: 

[a„(f),al(0] = 1 ■ 



(12.76) 
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The Heisenberg equation of motion for a n {t) is 

a n (?) = i [ H n (?) , a n ( ? ) ] = in [ a\ ( t)a„ ( ? ) , a n (?) 
This differential equation is solved by 



-in a„(t ) . 



a„ (?) 



e ln 'a n (0 ) = e 



(12.77) 



(12.78) 



where a n is the constant Heisenberg operator that equals a„ (?) at ? = 0. A similar 
calculation gives 

a «(?) = e ' n ' a i(0) = e ''" a l ■ ( 12 . 79 ) 

As you can see, the angular frequency of oscillation is indeed equal to n. Finally, with these 
results and (12.74), we can find the explicit time dependence of the operator q n (t)\ 

<?,.(?) = (a„(?) - al(t) ) = (a n e~ int - a\ e int ) . ( 12 . 80 ) 

This concludes our review of the quantum simple harmonic oscillator. 



We now turn to the discussion of an action that encodes the dynamics of the transverse 
light-cone coordinates X 1 (r, er). We claim that the action is simply given by 



S = J dTda C = J dr da(± ! X 1 - X 1 ' X 1 '^ . (12.81) 

This action is much simpler than the Nambu-Goto action; it has no square root, for 
example. The first term, which contains time derivatives, represents kinetic energy. The 
second term, which contains spatial derivatives, represents potential energy. The canonical 
momentum associated with X 1 coincides with the momentum density V tI : 



dC 
dX 1 



2jta' 



-X 1 = V Tl 



(12.82) 



as we see comparing with (12.1). This confirms that C is correctly normalized. The 
equations of motion for X 1 follow by variation: 



SS= 2 tuS I dX Jo “ M^ 7 )^') . (12.83) 



Restricting ourselves to variations where the initial and final positions are fixed, we can 
drop the total r derivatives and find 



ss =-i^f di [ m '(*' - *"')] 



(12.84) 



It is clear that the requirement that the action be stationary gives us both the wave equation 
(12.22) for the coordinates and the boundary conditions at the string endpoints. As a final 
check of the consistency of the action we calculate the Hamiltonian: 



H= f doH = 
Jo 




rl ± , 



C ) 



(12.85) 



249 



12.3 Strings as harmonic oscillators 



Writing the r derivative of X 1 in terms of T rI we find 

H=f d(j(jTa'V T, V Tl + — — X I 'x I '\. (12.86) 

Jo ' 47r a' / 

This Hamiltonian coincides with the one we postulated and tested in Section 12.1. 

Let us now use the action (12.81) to quantize the theory. For this purpose we replace 
the dynamical variable X 1 (r, a) by a collection of dynamical variables that have no a 
dependence. This is done by writing the expansion 



X 1 {t, a) = q\ r) + 2 Vo 7 ^^(r) 



cos no 



77 = 1 



(12.87) 



This is the most general expression that satisfies Neumann boundary conditions at the 
endpoints. The particular normalization used to introduce the expansion coefficients was 
chosen for convenience. 

Our next step is to evaluate the action (12.81) using the above expansion for X 1 (r, a). 
For this we use 



X 1 



X 



1 ' 



q I (x) + 2yfu i ]T^(r) 



cos na 



n = 1 



oo 

— 2VcF ^<7 ) {(T)v / n sinner . 

n = 1 



( 12 . 88 ) 



The evaluation of the action S using the above expansions is quite straightforward because 
the a integrals of (cos ner cos mo) and (sin na sin mo) vanish unless n — m. We find. 



5 = 



/ l ^ | ^ 

dT [^ 0l f^ I ^^ I( ' T) + ®»( T) «»( T ) - \ q n^ q n^y) ■ 

n = 1 



(12.89) 



Quick calculation 12.2 Prove equation (12.89). 

Comparing the action (12.89) with the one recorded in (12.68) we see that the qf t (r), with 
n > 1, are the coordinates of simple harmonic oscillators. The frequency of oscillation 
of ql t ( r) is n. This is the physical interpretation of the expansion coefficients in (12.87). 
Since the action for qj t (r) coincides exactly with the action S n , no new work is necessary 
to compute the Hamiltonian, except for the zero mode q 1 : 

P 1 = ^j = ^-,q 1 and [q 1 , p J ] = ir, IJ . (12.90) 

aq 1 2a ' 

The Hamiltonian is then given by 

oo 

H = a P 1 p 1 + J2 \ (p ! n Pn + <ln d'n) > (12.91) 

71=1 ^ 
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where we used (12.70) to write the part of the Hamiltonian that arises from the oscillators. 
The earlier analysis of the Heisenberg operator q n (f) led to the solution (12.80). This means 
that for the qj t (r) oscillators we have 

q!i(r ) = -^( fln / e -'"- fl „ / V" r ), (12.92) 

where (a^, a n z ) are canonically normalized annihilation and creation operators. For the 
Heisenberg operator q\ r) we find 

q\ r) = i [H,q'( r)] = ia'[p J p J (r) , c/(r)] = 2a' p 1 (r) . (12.93) 

Note that p l is a r-independent Heisenberg operator. We solve this differential equation 
for q 1 (r) by writing 

q 1 (r) — Xq + 2a' p 1 r . (12.94) 

Here is a constant operator that on account of (12.90) satisfies [xq , p J ] = iq IJ . Finally, 
we can substitute our solutions (12.92) and (12.94) into the expansion (12.87) for X 1 to find 

OO 

X 1 (r, a) = xk + 2a' p 1 x + i V 2a' (a! e~ ,nz — a[. ' e ,nz \ — — , (12.95) 

in exact agreement with the previously derived (12.67). We have therefore given a physical 
derivation of the mode expansion and commutation relations. We identified the classical 
variables that become oscillators, and we did not have to use delta functions. Having done 
so in this case, when we quantize other string configurations we will simply use the abstract 
approach of the previous section. It gives a direct and quick route to the desired answers. 



12.4 Transverse Virasoro operators 



We have written mode expansions for the transverse coordinates X 1 (r, a) and we have 
seen quite explicitly the connection to harmonic oscillators. How about the other light-cone 
coordinates, X + {x, a) and X~(x , er)? The expansion of X + is truly simple: 

X + (r,a) = 2a'p+ r = V^cFa+r. (12.96) 

As discussed for the classical case in Section 9.5, this means that we are setting 

x+ = 0, a+ = 0, n/: 0. (12.97) 

For the X~ coordinate a mode expansion was provided by (9.72): 

X~(r, a) = Xq + V 2 a' a q r + i V 2a' ^ -a~e~ ,nz 

o n 



cos no . 



(12.98) 
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Moreover, we used the constraints to solve for X~ in terms of X 1 , p + , and a constant of 
integration xC . This meant that the a~ modes could be written in terms of the modes, 
as shown in equation (9.77): 

. ( 12 - 99 ) 

where 

= 7 E a n-p a p ■ ( 12 - 10 °1 

peZ 



The repeated index I is summed over the transverse light-cone directions. In Chapter 9 the 
L- j- were called transverse Virasoro modes. Having seen that the a modes become opera- 
tors, the L,j[ will now be called transverse Virasoro operators. The steps that led to (12.100) 
remain valid in the quantum theory, except for the fact that the a modes were treated as 
commuting classical variables. We now know that the a operators do not commute. We 
must therefore question whether the ordering of the two a operators appearing in ( 12.100) 
is the correct one. A better question is whether the ordering matters. Since two a operators 
fail to commute only when their mode numbers add up to zero, the two operators in L^ 
fail to commute only when n — 0. So L {] is the only ambiguous operator. 



There is plenty at stake in ordering correctly. The operator L ( | is in fact the light-cone 
Hamiltonian, as we showed in equation (12.16). Moreover, we saw at the end of Chapter 9 
that Lq enters directly into the calculation of the mass of string states. We also mentioned 
at that time that the quantum theory would bring a subtlety into the calculation of the mass. 
Well, the subtlety has arrived: we must define the quantum operator L t \ ! Let us therefore 
look at Lq in more detail: 



l o = *E a -P a P = l a o a o + \ E 



-P a p 



E a P a -P- 



(12.101) 



p = 1 



p = 1 



The first sum on the right-hand side is normal-ordered: annihilation operators appear to 
the right of the creation operators. It is useful to work with normal-ordered operators since 
they act in a simple manner on the vacuum state. We cannot use operators that do not have 
a well defined action on the vacuum state. Since the last sum on the right-hand side of 
(12.101) is not a normal-ordered operator, we rewrite it as 




P = l p = 1 



1 OO 1 oo 

= 2E“-/-“f + 2 (D_2) E^ 

p = t p = i 



( 12 . 102 ) 
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If you look at the last term of the above equation you will note that it is divergent; it 
involves the sum of all positive integers! This is clearly problematic. How do we deal with 
this? One option is simply to ignore this difficulty, claiming that it is really up to us how we 
define L^r . There is a kernel of truth to this option, but it is not completely correct. Adding 
a constant to L' } changes the values of the masses of the string states, and if anything, the 
above computation has alerted us to the fact that this additive constant could be nonzero, 
or even infinite. Taken at face value, the above computation gives 

1 OO 1 oo 

4 = 2«0«0 + E a -P a P + 2 {D - 2) E P ■ (12 - 103) 

p = i p = i 

The operator Lq enters into our computation of the mass via the definition of p~ . From 
(12.99), with n = 0, 

= 2a' p~ = . ( 12 . 104 ) 

This suggests a strategy. First, we define, once and for all, L ( \" to be the normal-ordered 
operator in (12.103) without including the ordering constant: 



1 oo 

= -T“o«o "t" 'y 


OO 




= up 1 p 1 + J2p a p ] a p ■ 


(12.105) 


p= i 


p= i 





Note that L {) is Hermitian: (Lq )^ = L {) . Second, we introduce an ordering constant a into 
(12.104): 




If we took seriously our attempt to order , we would have to conclude that 

1 OO 

a = -(D-2)J2 p- 



p= 1 



(12.106) 



(12.107) 



We will discuss below one remarkable interpretation of this equation which does, in fact, 
give the correct result. More pragmatically, we will take a to be an undetermined constant. 
As we will show in Section 12.5, the quantum consistency of string theory fixes the con- 
stant a to an interesting finite value. Before proceeding further, let us investigate how the 
inclusion of a modifies the computation of the mass-squared operator. Working from the 
definition M 2 = —p 2 , and using (12.105) and (12.106), we find 

M 2 = -p 2 = 2 p + p~ - p 1 p 1 = ^(Tq + a ) - p 1 p 1 

a ' 

1 oo 

= — (a + E na n a n) • (12.108) 

n = 1 



As expected, a introduces a constant shift into the mass-squared operator. 
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It is impossible to resist the temptation to interpret (12.107). An important result in mathe- 
matics suggests a finite value for the right-hand side. For this we consider the zeta function 
£(s), which is defined as the infinite sum 

oo i 

£(*) = E-’ ^)> 1 - ( 12 - 109 ) 

n s 

n=\ 

The argument s of the zeta function is assumed to be a complex number, but as indicated, 
the above sum only converges if the real part of the argument is greater than one. We can 
use analytic continuation to define the zeta function for all possible values of the argument. 
£(s) turns out to be finite for all values of .v except s = 1. In particular, as you will see in 
Problem 12.4, £(— 1) = —1/12. On account of (12.109) this suggests that 

f(-l) = --I = l+2 + 3 + 4+-... (12.110) 

This is a surprising interpretation for the infinite sum Yl°p= l P- Not only is the result finite, 
but it is also negative! Substituting back in (12.107), it gives us 

« = -^CD-2). (12.111) 

This is actually the correct value of a, as we will explain in Section 12.5. The inspired 
guess gave the right answer. We will also see that consistency requires D — 26, so that, 
in fact, a = — 1 . This value for the shift in the mass-squared operator is precisely what is 
needed for the spectrum of open strings to include massless photon states! 



Having discussed L {] in detail, let us consider the other transverse Virasoro operators. 
Since («^)^ = a J _ n , we may expect that (a~)^ = aZ„ , or equivalently, on account of 



(12.99), that 



(K) 






( 12 . 112 ) 



We have already verified this equation for n — 0. For n 7 0, we can easily prove this 
Hermiticity property using (12.100): 



( L n 7 J2 (a ^~P a P ) 



t _ 






+p ' 



(12.113) 



peZ 



Since the oscillators in each term of the sum commute, we can exchange them. By also 
letting p —> —p, we get the expected result: 

^ = l -Y.° l -n-pK = L -n- ( 12 ' 114 > 

peZ 



Perhaps the most interesting property of the Virasoro operators is that they do not com- 
mute. We have seen that a ] m and a' n commute except when m + n equals zero. This is not 
the case with the a~ modes. Two Virasoro operators and never commute when 
m 7 n. The commutation properties of Virasoro operators are a bit intricate, so we will 
consider them in steps of increasing generality. 
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As a warmup, let us consider the commutator between a Virasoro operator and an oscillator 
a J n . We have 



[ 



a J 

» u n 



\ \ a in-p a p ■ a n 



\ J2{ a ™-P \ a P > “»] + \ a m-p 

peT, 




(12.115) 

Evaluating the commutators and recalling that r\ 1] = S I J , we find 



[ 



L l a J 

L, m » 



^ “ P)8m-p+n,oa J p ') ■ 

peZ 



(12.116) 



Because of the Kronecker deltas only one term contributes in each sum: p — —n in the 
first and p = m + n in the second. We thus find 






m+n 



m+n 



)■ 



(12.117) 



Our final result is therefore 




The mode number on the right-hand side is the sum of the mode numbers on the left- 
hand side. This could not be otherwise, since the basic commutator of a operators trades 
two operators with opposite mode number for a constant, and in doing so, the total mode 
number is conserved. Moreover, the spatial index on the oscillator is preserved. Equation 
(12.118) holds for all values of m, including m = 0. Indeed, (12.100), which we used, 
gives Lq up to a constant that, although infinite, cannot affect the commutator. It is worth 
anyway to check the result directly. 



Quick calculation 12.3 Calculate [Lq , ] using (12.105) and confirm that (12.1 18) holds 

for m = 0. 



Quick calculation 12.4 Show that 



[ 





—i'Jla' a f m . 



(12.119) 



Let us now consider the commutator of two Virasoro operators and L„. The com- 
putation is a bit subtle, and it is easy to get incorrect answers. We will avoid subtleties by 
checking, at every stage of the calculation, that our expressions are normal ordered. For 
this, we begin by rewriting the Visaroso operator (12.100) in such a way that it gives the 
correct result even for Lq. For this, the sum is split as: 

a m—k a k ^ } , a k a m—k • 

Z k > 0 Z k < 0 



( 12 . 120 ) 
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For any value of m the above right-hand side is normal-ordered: in the hist sum the a to 
the right is an annihilation operator (or a zero mode) and in the second sum the a to the 
left is a creation operator. We now evaluate 

[ L nr L n] = \ 0“™-*“* ’ L " ] + \ ■ L n] . 

Z k > 0 Z k < 0 

Z *>0 Z rt <0 

+ - L n] + \ K ’ L « • (12.121) 

Z t>0 Z i<0 

Evaluating the commutators we get 

i L i’ L n] = \ _ k ) aI m+n-k a k + \ ~ k ) a Wm+n-k 

Z k > 0 Z k< 0 

+ lj2 ka m-k a l+n + lj2 k0l k+n a !n-k ■ ( 12 . 122 ) 

z t>0 z t<0 

The terms on the first line of the right-hand side are always normal ordered. The terms 
on the second line can require ordering, depending on the values of m and n. Let us now 
consider two cases: m + n ? 0 and m + n — 0. 

Case m+n f- 0. All pairs of oscillators on the right-hand side of (12.122) commute so we 
find 

\ k m ’ k n ] 2 ^ — k ) a m+n—k a k + 2 ^ ! k a m—k a k+n ’ 

= ^ - k ^ a m+n-k a l + ^ ~ n) a m+n-k a lc - 

Z keZ Z £eZ 

= {m a m+n-k a l ■ (12.123) 

JreZ 

In passing from the hrst to the second line we let k —> k — n in the second sum. Since 
m + «/: 0 the hnal operator requires no ordering and is recognized to be L^ +n . We have 
therefore shown that 

[ Li ] = (m - n)Li +n , m+n? 0 . (12.124) 

The commutator of two Virasoro operators is a Virasoro operator with mode number equal 
to the sum of the mode numbers of the operators which enter the commutator. The above 
result does not hold when m+n — 0, in which case the answer is different. Nevertheless, 
as a mathematical construct, a set of operators L^ with n e Z, satisfying (12.124) for all m 
and n, defines an interesting Lie algebra (Problem 12.5). This algebra is called the Virasoro 
algebra without central extension or the Witt algebra. 
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Case m+n =0. In this case we write n — —m and equation (12.122) becomes 

1 OO j 

[ L m> L -m\ = 9 E ( '” ~ *>“-*“* + ? E ( ™ _ *)“*“-* 

Z k = 0 Z k < 0 

1 oo ^ 

+ 9 E k a m-k a Lm + 9 E k a k-m aI m-k ■ (12. 125) 

Z k=0 Z k < 0 

In order to compare the various terms it is useful to relabel the summation variables so that 
the rightmost oscillator is always a 1 /. . For this we let k — > —k in the second term of the first 
line, k —> m + A: in the first term of the second line, and k —> m — k in the second term of 
the second line: 

1 OO J OO 

[ L m- L -m ] = 9 E (m ~ *)“-*“* + 9 + k ) a -k a k 



k=0 



k=l 



J OO 1 oo 

- {m + k ) + 9 E {m ~ k ) a -k a k ■ 



(12.126) 



k=—m 



k=m J r 1 



We now assume, without loss of generality, that m > 0. It then follows that all terms are 
normal-ordered except those in the underlined summand for which —m < k < 0. Splitting 
the sum, the underlined term becomes equal to 



-k u k 



\ EE ~ v a k a -k + 2 E (m + ® a -k 

z k = 0 Z k= 1 

j m ^ m | oo 

= 2 E (,w - [“*>“-*] + 2 E (w - ^ a -k a k + 2 E (,w + ^ a -k a k ■ 

z i=0 Z *=0 Z fc=l 

Evaluating the commutator and substituting the result back into (12.126) we get 

OO OO 

[L„, = YX m ~ k ^ a -k a k + E> + k ) a -k a k + ( D - 2 )A(m) , (12.127) 

7fc=0 fe=i 

where A ( 777 ) is the constant 

j m j m 1 m 

A ( 777 ) = - k(m — k) = -777 E*-aE k 2 . (12.128) 

Z k=0 k= 1 Z fc=l 

To evaluate A ( 777 ) we need the following result: 

Quick calculation 12.5 Use mathematical induction to prove that 

m i 

= -(2777 3 + 3777 2 + 777). (12.129) 

' 6 
fc=i 

Making use of (12.129) we find 

A (77i) = —777 2 (777 + 1) — — (2777 3 + 3m 2 + 777 ) = — (777 3 — 77i) . 



(12.130) 
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Expanding out the sums in (12.127) and substituting the value of Aim), we find 



[ L m^ L -m\ =2m( 




k = 0 



We recognize that the operator in parenthesis is Lq . Therefore, our final result is 



This completes the computation for the case m + n = 0. 

The general result for the commutator of two Virasoro operators is obtained by writing a 
formula that gives (12.124) for m + n ^ 0 and gives (12.132) for m + n — 0. This is easily 
done and the result is 



A set of operators L; j-, with neZ, satisfying (12.133) defines the centrally extended Vira- 
soro algebra. The second term on the above right-hand side is called the central extension. 
It is a constant or, more properly, a constant times the identity operator (the operator that 
acting on any state gives back the state). This term is said to be central because it com- 
mutes with all other operators in the algebra. The central term vanishes for m = 0 and for 
m — ±1. There is therefore no central term in the commutator [L\, T_i]. The Virasoro 
algebra is perhaps the most important algebra in string theory. In the light-cone quantiza- 
tion of string theory - our subject in this chapter - the Virasoro operators enter into the 
definition of the Lorentz generators, as we will see in Section 12.5. 

We conclude this section by studying how the Virasoro operators act on the string coor- 
dinates. Since quantum operators act via commutators, we must find the commutator of 
a Virasoro operator with the coordinate operator X 1 (r, er). We will see that the Virasoro 
operators generate reparameterizations of the world-sheet. 

Making use of the coordinate expansion (12.32) we find 




(12.132) 




(12.133) 



[ L^ n , X 1 (r, o) ] = [L^, Xq] + iV2a> 



1 

— cos yio e 
n 



—inz 




n/0 




72/0 



(12.134) 
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where we used (12.118) and (12.119) to evaluate the commutators. The right-hand side 
above can be written as a single sum: 

\- L m^x\x , cr) ] = -zV 2a 7 ^2 cos na e~ lnr al n+tl 

= -,V 2^ J £( e ~‘ n(Z ~ a) + e- in(z+a) ) «' +fI 

neZ 

= -is/w - J2(e- iin - m)(T ~ cf) + g-itn-mXr-HT))^ > 

2 neZ 



where in the last step we let n -* n — m. Finally, 

[ , X 1 (x, a) ] = - e im(T - a) VW J2 e~ in(T ~ a) ^ 

2 „eZ 

_ i e im(r+<r)^2^ e~ in(z+cr) aj, . 
2 neZ 



To interpret this result it is necessary to express the right-hand side in terms of derivatives 
of the string coordinates. This is done with the help of (12.33): 



[L^,Z 7 (t,ct)] = - l - e lm{x - a) ( X 1 - X 1 ’) - l - e imi - r+a '> {X 1 + X 1 ') 



-ie lmr cos mo X 1 + e lmz sin mo X 1 



(12.135) 



This equation has taken the form 

[ Li , X\x, o) ] = ^ X 1 + %° n X V , (12.136) 

where 

%i(x, o) = —ie' mx cos mo , 

$°(x ,er) = e imz sinmcr . (12.137) 



The interpretation of (12.136), we claim, is that the Virasoro operators generate reparame- 
terizations of the world-sheet. In particular, they change the r and a coordinates as 

r — * x + e^(t, a), 

o — * ff + e!£(r,tr), (12.138) 



where e is an infinitesimal parameter. In order to see this, note that Taylor expansion gives 

X'(x + e ^ , tr + e C) = ^ e + C X 1 ') 



= X 1 (r, ct) + e 






r, cr) 



(12.139) 



This equation states that the action of the Virasoro operators on the string coordinates 
generates the same change that would occur as a result of a reparameterization of the 
world-sheet. This is what we wanted to show. 
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What is the reparameterization generated by ? Setting m = 0 in (12.137), we find 
= —i and ^ = 0. As a result, (12.136) gives 

[L^X 1 ] = -idrX 1 , (12.140) 

which we recognize as the Heisenberg equation of motion for X 1 . Indeed, L g is, up to 
an additive constant, the string Hamiltonian, and as such, it must generate time transla- 
tions. It is also interesting to note that for all m, vanishes at a — 0 and at a — tc . This 
means that the reparameterizations generated by the Virasoro operators do not change the 
a coordinates of the string endpoints. The range of a remains [0. jt]. 

The functions and in (12.137) are not real, and used in (12.138) they spoil the 
reality of the coordinates r and a. This complication is familiar to you from quantum 
mechanics. Real transformations are generated by anti-Hermitian operators. The momen- 
tum operator p = — ;'V is Hermitian, and therefore it is the anti-Hermitian combination 
ip = X that generates real translations. Out of the operators and L \_ m , we can generate 
two anti-Hermitian combinations: 

Li~Li m and i (LjJ; + L±J . (12.141) 

Consider the first combination. It follows from (12.136) that the parameters for the 
transformation generated by ( L i — Li m ) are 

^ = 2 sin mr cos mu , 

r = % a m - %°_ m = 2cosmr sin mo . (12.142) 

Quick calculation 12.6 Show that the parameters for the transformation generated by 
i (Li + L± m ) are 

= 2cosmr cos ma, 

= — 2sinwr sin ma. (12.143) 

Our discussion of the Virasoro operators has been quite detailed. We have examined their 
precise definition, and we have seen how they affect the computation of the mass. We 
have determined their commutator algebra, and we have exhibited how they act on string 
coordinates. In the following section we will see that the Virasoro operators also enter into 
the definition of the operators that generate Lorentz transformations. 



12.5 Lorentz generators 



In Chapter 8, the Lorentz invariance of the string action allowed us to find a set of con- 
served world-sheet currents A4" ,, labeled by the indices p and v, with /i /v. The resulting 
conserved charges M^, were given in (8.65), and for open strings with a e [0, tt] they read 

°)da = f (X,jJ>l - X v VD da. 

J 0 




(12.144) 
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Making use of (12.1) and raising the spacetime indices, 

M^ v = ^-“7 J (x^X v - X v X^do. (12.145) 

Constructing suitable quantum operators can be delicate, so let us gain some intuition by 
thinking classically. Explicit mode expansions for X 1 ' and X v are given in equations (9.56) 
and (9.57). Since M /lv is guaranteed to be r -independent, to evaluate (12.145) it suffices 
to pick up the r -independent terms that arise in the products. For example, 

X^X V — Xq (v / 2o' / c*q) + i 2 a 1 ^ - a^a v _ n cos 2 no + ■ ■ -, (12.146) 

n^o n 



where the dots represent r-dependent terms that must fail to contribute in the calculation 
of M ILV . With this equation, and a similar one with /x and v exchanged, we find that upon 
integration (12.145) gives 



M^=x£p v -xZp» 



oo . 

“ i Jl-( a -n a n -“-«<)• 



(12.147) 



n= 1 



Quick calculation 12.7 Prove equation (12.147). 

Equation (12.147) gives the classical Lorentz generators in terms of oscillation modes. We 
should ask ourselves whether we can use it, with the a recognized as operators, to define 
the quantum Lorentz generators. We will use (12.147) to suggest the form of the quantum 
Lorentz generators in light-cone gauge string theory. Since the canonical structure of the 
theory in the light-cone gauge is unusual, there is no guarantee that we can build consistent 
quantum Lorentz generators. An inability to construct quantum Lorentz generators would 
mean that quantum string theory fails to be physically Lorentz invariant. 



In light-cone gauge, the most delicate quantum Lorentz generator is M 1 because the X~ 
coordinate is a rather nontrivial function of the transverse coordinates. A consistent M~ l 
must generate Lorentz transformations on the string coordinates, possibly accompanied 
by world-sheet reparameterizations. Indeed, in the simpler context of the point particle, 
the action of M~ l includes world-line reparameterizations. The generator M~ l must also 
satisfy the commutation relation 



[ M 



M 



-J 



= 0. 



To find a candidate for M 1 , we consider equation (12.147) and write 




71=1 



(12.148) 



(12.149) 



This is just a first guess, though it is a pretty good one. A satisfactory M~ l should be both 
Hermitian and normal-ordered. Let us consider Hermiticity first. The first term in the right- 
hand side of (12.149) is Hermitian since and p 1 commute. The second term, however. 
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is not, since xl and p do not commute. A simple solution is to symmetrize the term by 
writing 

t 1 °° 1 

M~‘ = x~ p 1 - - (xq p~ + p~Xq ) - i J2 ~ (“-«“« - «-n“n ) ■ (12.150) 

n = 1 

The last term above is fully Hermitian since (o^)' = aL n and (a“)l = aZ n . Consider now 
normal ordering. Do all of the annihilation operators appear to the right of the creation 
operators? They do, because the a~ oscillators are normal-ordered Virasoro operators. 
Finally, to be complete, we must give the definition of p~ . As stated in (12.106), p~ 
includes an undetermined constant a that reflects our difficulties in ordering the Vira- 
soro operator L^. With this definition, and writing the other minus oscillators in terms 
of Virasoro operators, we find 

M~‘ = x~ p 1 - (x' (/.,) + a) + (c,j + a) x' {) ) 

CO , 

- -p=L— £ - U-n< - • (12.151) 

V2a' p+ , n \ ) 

Now that we have a candidate for the quantum Lorentz charge M -/ , we can discuss the 
computation of [M ~ l , M~ J ], 



There is much at stake in this calculation. It is in fact, one of the most important calculations 
in string theory. Our Lorentz charge has two undetermined parameters: the dimension D 
of spacetime, implicit in the sums over transverse directions, and the constant a affecting 
the mass of the particles. The calculation is long and uses many of our previously derived 
results, including the Virasoro commutation relations. We will not attempt to do it here, 
but the result is 



[m _/ , M~ j 



1 UXJ 

L_ V (V a J -a J a /N ) 

r m= 1 

x („ 1 [ | -i (0 - 2 ) ] + i[i (0 - 2)+fl ]|. 



(12.152) 



The right-hand side is a sum of terms, each of which contains the operator 
( a!_ m a/ n — for a different value of m. Such terms cannot cancel each other, so 

the commutator above vanishes if and only if the coefficient in large braces vanishes for all 
positive integers m : 



i[l - — (D -2)1 + -T- 
L 24 J m 12* 



-\ T Y D ~ 2) 



= 0, V m eZ + . 



(12.153) 



It suffices to examine this condition for m = 1 and m = 2 to conclude that each of the 
terms in brackets must simply vanish. We therefore have 

l-^(D-2) = 0 and 1 (D - 2) + a = 0 . 

The first equation fixes the dimension of spacetime: 



(12.154) 
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D = 26. 


(12.155) 


The second equation then fixes the constant a : 


1 24 




a = (D — 2) = = -1. 

24 24 


(12.156) 



This value of a coincides with the one obtained in (12.111) by ordering and using 
the zeta function to interpret the resulting infinity. For future reference, with a = — 1 the 
expression for p~ in (12.106) becomes 



2 a' p 




I'] addition, because of (12.14) the string Hamiltonian is now just 



(12.157) 



H = Lfr- 1 . 



(12.158) 



Here, of course, L ( ' t is the normal-ordered operator without additional constants. The above 
equation is the precise version of equation (12.16). 

In summary, we have seen that the condition of Lorentz invariance of quantum string the- 
ory simultaneously fixes the dimension of spacetime and the constant shift in the masses of 
the particles. In superstring theory a similar calculation fixes the dimensionality of space- 
time to the value D = 10. The fact that string theory cannot be a good Lorentz invariant 
quantum theory in any arbitrary dimension shows that string theory is very constrained. 
Even more, since the dimension of spacetime is uniquely selected by the requirement of 
consistency, we can say that string theory predicts the dimension of spacetime ! 



12.6 Constructing the state space 



The classical open string does not provide a reasonable theory of physics because the mass 
of string states assumes a continuous range of values. Only the ground state is massless 
in the classical theory, and this ground state does not include any polarization labels. As a 
result, classical open strings have no states that can be identified as photons. The miracle of 
quantum string theory is that both of these problems are solved. The continuous spectrum 
disappears after quantization. Candidate photon states emerge because the downward shift 
of the squared masses gives us massless states with polarization labels. 
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Let us begin by introducing the ground states of the quantum string. The quantum string 
shares with the quantum point particle the same set of zero modes. We have the canonical 
pairs (xg, p 1 ) and (Xq , p + ). Therefore, just as for the point particle (see (11.43)), we 
introduce states 



I P + , Pt >• 



(12.159) 



The above states are called ground states for all values of the momenta indicated by the 
labels. They are also declared to be vacuum states for all the oscillators in string theory. 
Thus, by definition, they are annihilated by all the aj t : 

al\p + , p T ) = 0, n> 1, 1 = 2,..., 25. (12.160) 



How do we create states from the \p + , pr)7 We simply act on them with the creation oper- 
ators. There are infinitely many of them, and we can operate on each state arbitrarily many 
times with any particular creation operator. The list of creation operators at our disposal is 
infinite, but it can be organized as follows: 



a 

a 

a 



(2)t 


(3)1 


(25)1 


1 




' ’ flj 


(2) t 


(3)1 


(25)1 


2 


#2 


a 2 


(2) t 

n 


(3)1 
a „ 


(25)1 

■ • an 



(12.161) 



Above, the polarization index I has been enclosed by parentheses. The general basis state 
| A) of the state space can be written as 



oo 25 



w=nn 



\P + , Pt) 



n=l 1=2 



(12.162) 



Here the non-negative integer A , n j denotes the number of times that the creation operator 
a „ appears. As you can see, the state a ) is specified by stating how many times each of 
the oscillators in the list ( 12 . 161 ) acts on the ground state. This information is given by the 
list of non-negative integers X n j for all n > 1 and all I = 2, . . ., 25 . Since all the creation 
operators commute among each other, the order in which they appear is irrelevant. We 
restrict ourselves to the case where states only have a finite number of creation operators 
acting on the ground states. This means that for each state |A) only a finite number of A,,./ 
are different from zero. The string Hilbert space is an infinite-dimensional vector space: 
it is spanned by an infinite set of linearly independent basis states |A). This is why string 
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theory describes an infinite number of different particles! A general state in the Hilbert 
space is a linear superposition of the basis states |A). 

To understand the physical significance of the above states, consider the mass-squared 
operator (12.108), with our new found knowledge that a = — 1: 

1 OO 

M 2 = -(-l + ^^). (12.163) 

n = 1 

The sum appearing in (12.163) is important enough to have its own name; it is called the 
number operator N x : 

oo i 

N ± = J2 na n ta h' = + (12.164) 

n = 1 

N 1 - is the sum of standard number operators, one for each harmonic oscillator in the string. 
The main property of N x is that its commutator with a creation operator gives the mode 
number of that operator: 

[N x ,a I J]=naj l \ (12.165) 

as you can readily verify. In addition, 

^N x ,a I r ^ = -na , n . (12.166) 

Since the number operator is normal-ordered it annihilates the ground states: 

N x \p + , p T ) = 0- (12.167) 

Note, incidentally, that the number operator N" enters into the definition of L f) in (12.105). 
We can write 

= a>V + N x . (12.168) 

Let us get some practice using N x by computing its action on some basis states. 
Consider, for example, its action on a\ \p + , pj): 

N±a 2 \p + , pt) = \n x ,o I 2 ^\p + , pt) + a\ 1 N x \p + , p T ) = 2a!, p + , p T ) ■ 

The state is an eigenstate of N 1 - with eigenvalue 2. Now let us try a more complicated 
state: 

N x a^ aj \p + , pj) = [n -1 , \p + , pj) + a^' N^a 1 ^ \p + , p T ) 

= 5afa^\p + ,p T ) . (12.169) 

It is clear that when the number operator acts on a basis state, the eigenvalue is the sum of 
the mode numbers of the creation operators appearing in the state. In general, for the basis 
state | A,) in (12.162) we have 
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oo 25 

N- L \k) = N^W, with N^=^2^2nX nJ . (12.170) 

n=\ 1=2 

Since N L enters additively into the mass-squared operator (12.164), we see that the oscil- 
lator with mode number n contributes n units of 1 /a' to M 2 . The eigenvalues of N 1 are 
non-negative integers, so for all string states M 2 > — 1/a'. 

The open string state space is naturally equipped with an inner product. To define this 
inner product we introduce bras (p + , pj I which are the Hermitian conjugates of the kets 
| p + , pj), and declare that 

(p ,+ > Pt\ P + > Pt ) = s (p' + - P + )S(.Pt - pt) • (12.171) 

The delta functions here are necessary because the Hermiticity of the operators p + and pj 
guarantees that the overlap above vanishes unless the p + and pj eigenvalues of the states 
are identical. For the basis state |A) in (12.162), we introduce the Hermitian conjugate bra 
(A. | defined by 

oo 25 ^ 

w = ( p + , pt i n n ( a 0 ,hl - ( i2 - i72 > 

n= 1 1=2 

The Hermitian conjugate of \X)b, where b is a number, is simply b*{X\. The inner product 
(X ' , X) between two basis states |A') and |A) is defined as 

(X' , X) = (X'\ X) . (12.173) 

For arbitrary states, the inner product is defined by declaring it to be linear on the second 
argument and antilinear on the first argument. To evaluate the overlap (A/|A) one moves the 
annihilation operators in (A/| towards the ground state in a) and the creation operators in 
| A) towards the ground state in (A'|. In moving these operators across each other we use the 
commutation relations (12.64), which appear in the standard form because in the overlap 
the annihilation operators are to the left of the creation operators. The commutators give 
+ 1 whenever they are nonvanishing, so the result of simplifying the oscillator content of 
the overlap is a positive number times the basic overlap in (12.171). For example, take 
| A') = | p' + , pj) and | A) = a ^ \p + , pj). We then find 

<A'|A>= (p ,+ ,p^a I 1 a( t \p + ,p T )=S IJ S(p ,+ - P + )S(Pt-Pt), (12.174) 

where we evaluated the overlap by replacing the product of oscillators by their commutator 
and using (12.171). Since the delta functions are positive functions, we conclude that all 
basis states | A) have positive norm: (A, A) > 0. This inner product does not vanish because 
the annihilation operators on the bra match with the creation operators on the ket. Tech- 
nically, (A, A) is infinite because of the delta functions, but this infinity becomes harmless 
when we consider continuous superpositions of states. Using the linearity and antilinearity 
of the inner product, we conclude that any state has positive norm. This is consistent with 
our statement that the open string state space is a Hilbert space. 
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Quick calculation 12.8 Explain why (A/, X) — 0 whenever the basis states |A/) and |A.) are 
different. 

For each state |A) in (12.162), we can construct the corresponding time-dependent physical 
state 



which satisfies the Schrodinger equation with Hamiltonian (12.158). In the light-cone 
quantization of the string a time-dependent state is physical if it satisfies the Schrodinger 
equation. We will consider more general superpositions of time-dependent states in the 
following section. 

We are now ready to discuss particular states in some detail. We will begin with the simplest 
ones, the ground states. These are the unique states with N L — 0. As in the case of the point 
particle, the states \p + , pj) are the one-particle states of a scalar field. They are states of a 
scalar particle. What is the mass of this particle? To find out, we act on the states with M 2 : 



The value of M 2 is all due to the ordering constant. If this constant had vanished the 
mass would have been zero. In fact, massless scalars are problematic - they have not been 
observed in nature. The result, however, is strange: M 2 = —l /a' < 0. The wavefunction 
of the state tells the same story: \[r (r, p + , pj) can be set in correspondence with a classi- 
cal scalar field. This scalar field, which has a negative mass-squared, is called a tachyon. 
A negative mass-squared is a sign of instability: the potential for a scalar field goes like 
V = \M 2 <p 2 (see (10.2)), so a negative M 2 simply means that the stationary point 0 = 0 
is unstable. The energy can be reduced by having 0^0. We will study this further in 
Section 12.8. 

Let us consider now the excited states with lowest M 2 . Those arise when N 1 - takes 
the smallest possible nonzero value, N 2 - = 1. Remarkably, due to the ordering constant, 
the N 2 - = 1 states have M 2 = 0. They are massless states. Had the ordering constant taken 
a noninteger value, quantum string theory would have no massless states. We get states 
with N 2 - — 1 when we act with any of the transverse oscillators a[ on the ground states 
| p + , pj). That means that we have D — 2 = 24 massless states: 




( 12 . 175 ) 




a[\p + , pt) , M 2 a{'\p + , p T ) =0. 



(12.177) 



The general massless state is a linear combination of the above basis states: 



25 



^Hia[\p + , pt) ■ 



1=2 



(12.178) 



267 



12.6 Constructing the state space 



The above expression may remind you of the photon states (10.88) that we found in our 
light-cone analysis of Maxwell theory: 

D - 1 

J2^ a p\p^ Q) - (12 - 179) 

1=2 

We have a matching of states: in both cases is an arbitrary transverse vector, and the 
states correspond to one another: 

a[\ P + ,PT) <— ► %\ pT \^). (12.180) 

Both states have exactly the same Lorentz labels, they carry the same momenta, and have 
the same mass. This proves a remarkable result. The open string theory quantum states 
include photon states! Our study of open string theory started from the Nambu-Goto 
action. This action has no hint whatsoever of electromagnetic gauge invariance. Never- 
theless, we have shown that open string theory contains Maxwell field excitations. This 
astonishing result is largely due to the mass shift encountered in passing from the classical 
to the quantum theory of the open string. 

It is worth belaboring the point. In Chapter 10 we showed that the quantum states of free 
Maxwell theory - the photon states - are (D — 2) massless states, labeled by a transverse 
Lorentz index. The index is important: it indicates that these states transform into each 
other under Lorentz transformations. Exactly these kind of states have appeared in our 
quantization of the string. Additionally, the collection of wavefunctions i /r/(r, p + , pr) 
associated with the states in (12.177) matches with the components A/ of the Maxwell 
gauge field. Finally, the Schrodinger equation for these wavefunctions matches the light- 
cone gauge field equation for the Maxwell field. We will show this in the following section. 

Let us conclude with an examination of the states with N L — 2. They are built by acting 
on the ground states with a[' aj' or with . The number of states built with a[ a { is the 
same as the number of independent entries in a symmetric matrix of size (D — 2), namely 
lj(D — 2)(D — 1). The number of states built with a!^ is (D — 2). The total number of 
states is therefore 

\{D - 2 )(D - 1) + (D - 2) = 1 -{D - 2 )(D + 1) , (12.181) 

and their mass-squared is given by M 2 = 1 /ot\ These particles are known as massive ten- 
sors, and in D — 26 there are 324 such states. Our results for all states with N 1 - < 2 are 
summarized in Table 12.1. A formula useful to count states at higher values of N 1 - is 
derived in Problem 12.11. The formula tells you that at N 1 - = 3, for example, you get 
2600 states by acting with a [ ' af ' af ' on the ground states. 

Each state |k) of the quantum string represents a one-particle state of fixed momentum. 
Thus, the a\\p + , pj) are one-photon states, and the a 1 ^ \p + , pr) are one-particle 

tensor states ( not two-photon states). Each state | A.) has discrete labels k n j and continuous 
labels p + and pr- There is one wavefunction for each set of discrete labels, as you can 
see in the table. Accordingly, there is one quantum field for each set of discrete labels. 
The multiparticle states are described using these quantum fields. The total quantum field 
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w 

Table 12.1 


List of open string states with N 1 - < 2 




N+ 


W 


a'M 2 


Number of states 


Wavefunction 




0 


lp + . PT ) 


-1 


1 


Vf(r, p + , Pt ) 




1 


af\P + , PT ) 


0 


D — 2 


fil(x,p + ,p T ) 




2 


a[ t af\p+,p T ) 


1 


\(D — 2)(D + 1) 


filjit, P+, Pt) 






a^\P + , PT ) 






p+, p T ) 




An unoriented open string is a quantum state that can be obtained by superposition of states 
that differ only by orientation. The unoriented state is invariant under orientation reversal. 

theory which describes the whole set of quantum fields associated with the one-particle 
states of the string is called string field theory. 

The quantum theory we have discussed so far is that of oriented open strings. The quan- 
tum operator X 1 (r, er) involves a parameter a e [0, jr] and thus an orientation, defined 
as the direction of increasing a . It is possible to define a theory of unoriented strings 
(Problem 12.12). The key idea is that one can define an operator £2 that is a symmetry of 
the theory (commutes with the Hamiltonian) and reverses the orientation of strings. The 
theory of unoriented strings is obtained by restricting the oriented string spectrum to the 
set of states that are invariant under the action of £2. Unoriented strings are not strings 
without orientation: they should be viewed as quantum superposition of states that, as 
a whole, are invariant under orientation reversal. We can imagine an unoriented state as 
the superposition of a string state and the same state with opposite orientation, as shown 
in Figure 12.1. 



12.7 Equations of motion 



To elaborate on the correspondence between string states and quantum fields we now 
consider the Schrodinger equations satisfied by the string wavefunctions. We saw in 
Section 11.4 that the Schrodinger equation for the point particle wavefunction is isomor- 
phic to the classical field equation of a scalar field. We want to repeat such an analysis for 
the string. 
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To construct general time-dependent states from the string basis states we need 
wavefunctions. Consider, for example, a basis state 

...a^\p + ,p T ). (12.182) 

The general time-dependent state built by superposition is 

I 'I', r > = J dp + dpTfi i ...i k {r,p + ,pT)a I n \ ] ...a , n k k \p + ,p T ). (12.183) 

The polarization indices carried by the oscillators match with the index labels of the wave- 
function i///| ...r k (r, p + , pr )• This equation is the string analog of (11.48), which gives 
the general time-dependent state of the point particle. For general tachyon states (12.183) 
becomes 

| tachyon, r) = J dp + dpj i/r (r, p + , pj) \p + , Pt) ■ (12.184) 

For photon states we write 

| photon, r) = J dp + dpj Vo(u p + , pr) a[^ \p + , pr) ■ (12.185) 

The Schrodinger equation satisfied by the general states (12.183) is 

3 

i — l^, r) = H\V, r). (12.186) 

3r 

Here, the Hamiltonian is given by 

H = (Lq — 1) = a'p'p 1 + N 1 - - 1 = a'ip 1 p 1 + M 2 ) , (12.187) 

where we used (12.158) and (12.168). Using the explicit expression (12.183) for the states, 
equation (12.186) gives: 

= (a'p’p' + N 1 - - l) ir h ... h , (12.188) 

where N 1 - denotes the eigenvalue of the operator N 1 for the state (12.183). 



Quick calculation 12.9 Show that equation (12.188) emerges from the Schrodinger equa- 
tion (12.186). The calculation parallels that which gave (1 1.53). 



For the tachyon states (12.184), N 1 - — 0, and we get 

ij^T = (a'p’p 1 - 1 )f- 

For the photon states (12.185), N 1 - = 1, and we get 



• 9l A/ , j j . 

i —— = a p J p J if! . 
dr 



(12.189) 



(12.190) 



Let us now compare these Schrodinger equations with the relevant classical held equations. 
We showed in Chapter 10 that the scalar held equation 



^3 2 - <p 



= 0 , 



(12.191) 
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could be written as (10.30): 




Letting x + = 2a' p + r we now have 




This equation is precisely the same as (12.189) when m 2 = —1/a', confirming the identi- 
fication of the tachyon with a scalar field. Perhaps more surprisingly, equation (12.193) is 
structurally equivalent to the Schrodinger equation (12.188) satisfied by any string wave- 
function. The only difference is that the wavefunctions carry indices. As a result, if the 
correspondence is to hold, the classical field equation for the field associated with any 
string state must take the form (12.191), with the field carrying some indices. 

This may seem strange: is not the Maxwell classical field equation, for example, more 
complicated than the field equation for a scalar? Not in the light-cone gauge. We noticed 
this before: equation (10.83) showed that the transverse components of the gauge field 
satisfy p 2 A ! (p) = 0. This is of the form (12.191) with m 2 = 0. The steps leading from 
(12.191) to (12.193), when applied to d 1 A I = 0 give 



This classical field equation for the Maxwell field is in complete correspondence with the 
Schrodinger equation (12.190) for the N 1 - = 1 wavefunctions. 



We conclude this chapter by discussing the physics of the tachyon. We explained earlier 
that the tachyon state has the lowest value of M 2 : 



The field associated with this state is a scalar field. What does it mean for this scalar to 
have a negative M 2 ? The physics of the open string tachyon was a mystery ever since 
the discovery of string theory. A series of developments starting in 1999 have essentially 
elucidated the role of the open string tachyon. Let us discuss what has been learned. 

Our first goal is to understand the instability of a theory with a tachyon. For this purpose 
we consider the Lagrangian density for a classical scalar field, along the lines of Section 
10.2. In some generality, 




(12.194) 



12.8 Tachyons and D-brane decay 




(12.195) 




(12.196) 
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(a) A potential 1 7(0) = ±A1 2 0 2 with positive mass-squared At 2 . The value 0 = 0 is a stable 
critical point, (b) A potential 17(0) = ±M 2 0 2 with negative mass-squared At 2 . The value 
0 = 0 is an unstable critical point. 



where V (0) is the potential for the scalar field. For spatially homogeneous field config- 
urations, V0 = 0, and the potential energy density is given by the potential V (0). The 
equation of motion following from variation is 

9 2 0 -V'(0) = O, (12.197) 

where prime denotes the derivative with respect to the argument. More explicitly, 

~TT + V 2 0 - V'(0) = 0. (12.198) 



Quick calculation 12.10 Prove that equation (12.197) arises from variation of the action 
S = f d D xC. 



To understand the instability of the tachyon scalar field theory it suffices to consider the 
free part of the tachyon Lagrangian; interactions will feature later. For a free scalar field 
theory, the potential V (0) takes the form 

1/(0) = 1m 2 0 2 . (12.199) 

Here M 2 is the mass-squared of the scalar field (this potential will change with the inclu- 
sion of interactions). When M 2 > 0, the potential V (0) has a stable minimum at 0 = 0. 
When M 2 < 0, V (0) has an unstable maximum at 0 = 0 (Figure 12.2). We can understand 
the implications of such potentials by studying the equation of motion for the field. Using 
the specified form of V, equation (12.198) gives 

9 2 0 7 t 

- — + V 2 0 - M 2 0 = 0. (12.200) 

ot A 

To make our analysis simpler, let us assume that the field 0 depends only on time. The 
equation of motion then becomes 



d 2 <p(t ) 
dt 2 



+ M 2 0(f) = 0. 



( 12 . 201 ) 
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When M 2 = M ■ M > 0, the solutions of this equation represent oscillations: 

0 — A cos (Mr) + B sin (Mr) = C sin (Mr + cm) ■ (12.202) 

This is the interpretation of a scalar field with a “good” mass-squared. The scalar field 
could sit at 0 = 0 forever because it is a stable point; if it is displaced, it simply oscillates 
around 0 = 0. 



Consider, on the other hand, the tachyon, which is an example of a scalar with negative 
mass-squared. In this case it is convenient to write M 2 = — /6 2 = ■ /3, and equation 

(12.201) becomes 

d 2 (b(t) 7 

-^-/i 2 0(r) = 0, (12.203) 

with p 2 > 0. This time the solutions are 

cp(t) = A cosh (jSr) + B sinh (fit) . (12.204) 



Consider the solution (pit) = sinh (/Sr). At time zero (p is zero, but as time goes to infinity 
(p also goes to infinity. We can imagine this as the field rolling to the right of the potential in 
Figure 12.2(b). In fact, for any nontrivial solution, (p necessarily reaches infinite absolute 
value either in the far past or in the far future. The tachyon could stay at (p — 0 forever, 
using the trivial solution (pit) — 0, but any infinitesimal perturbation would set it on a 
course to a dramatic roll-off. The value (p = 0 is an allowed critical point, but it is unstable. 
We cannot realistically expect the tachyon to stay near 0 = 0 for an indefinite length of 
time. This is the instability of a theory that contains a tachyon. Since the mass-squared of 
the open string tachyon is equal to (— 1 /ex'), the free part of the tachyon potential is 

V,achW = -^ 2 - (12-205) 



A mechanical analogy works for arbitrary potentials V(ep). You can visualize the spa- 
tially homogeneous rolling of a scalar field on a potential V i<p) by considering the motion 
of a particle on the potential V (x), where x, replacing 0, is the coordinate along the motion. 
Indeed, the relevant equations match. For an arbitrary potential V (0), homogeneous rolling 
is governed by 



d 2 ep 
dt 2 



-v\4>). 



(12.206) 



while the rolling of a unit-mass particle on a potential V(x) is governed by Newton’s 
second law: 



d 2 x 
dt 2 



— V'(x) . 



(12.207) 



The presence of a tachyon signals an instability of open string theory. More precisely, 
there is some instability in the theory of open strings on the background of a space-filling 
D25-brane. It is clear that we should try to understand the fate of this instability: once the 
tachyon begins to roll, where does it end? For a while, not everyone agreed that this was an 
urgent question. Some argued that along with the lack of fermions, the tachyon was another 
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good reason to consider this open string theory unrealistic and not worth much study. Some 
even saw the tachyon as a sign that open bosonic string theory is simply inconsistent. For 
quite a few years, superstring theories, the kind of string theories that also include fermions, 
seemed blessedly devoid of tachyons. Later studies, however, showed that tachyons can 
appear when we construct realistic models based on superstrings. It then became clear that 
we must try to understand tachyons. 

The open string theory we have in our hands is the theory of strings on a D25-brane, a 
D-brane that fills all of the space dimensions. The D25-brane is a physical object, not just 
a mathematical construct, so it has a constant energy density 725 which, in fact, can be 
calculated exactly. The key insight can now be stated: the theory of open strings is, in some 
sense, the theory of the D25-brane itself! We have viewed tachyons as states of strings 
attached to a D-brane. A D-brane with open strings attached, it turns out, is an excited 
state of the D-brane. If this is so, a tachyon state represents an excitation that can lower 
the energy of the D-brane. The existence of the tachyon is telling us that the D25-brane is 
unstable ! 

Since the tachyon describes the physics of the D25-brane, the energy density of this 
brane is a contribution to the potential energy of the system, and it must be incorporated 
into the tachyon potential. As a result, the potential in (12.205) is changed into 

Ttach (0) = r 25 - - 1 - <p 2 + w 3 + • • •. (12.208) 

lol' 

We have also included in the tachyon potential a cubic term and represented other possible 
terms by dots. All the terms that are cubic or higher order in the field represent the effect of 
interactions. The above potential describes correctly our statements about the D25-brane. 
The unstable point cf> = 0 represents the world with a D25 -brane, and therefore has an 
energy density T 2 5 . To find out what happens when the tachyon starts rolling down, we 
need to calculate the full tachyon potential Vtach (</>)■ 

The physics can be anticipated before computing this potential. If the D25-brane is 
unstable, it will decay. The stable endpoint of this process would be a world without 
the D25-brane. If this is so, the tachyon potential must have a stable critical point at 
some 4> — <p* with Vtach (</>*) = 0. That stable critical point would represent a background 
with zero energy, a background where the D25-brane, rendered unstable by the tachyon, 
has disappeared completely! The expected form of the tachyon potential is shown in 
Figure 12.3. 

This proposal has now been verified convincingly. The complete tachyon potential was 
calculated using the field theory of open strings and, remarkably, it was possible to display 
a critical point with zero energy. With this result, and additional evidence obtained by 
other means, physicists have demonstrated that the tachyon instability is the instability of 
the D25-brane. 

What happens when the tachyon rolls down to the stable minimum and the D25-brane 
disappears? All the open strings must also disappear, because open string endpoints are 
confined to D-branes. Closed strings, however, can exist in the absence of D-branes. All 
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The tachyon potential for the open string theory based on a D25-brane. The configuration 
(j> = 0 represents the unstable D-brane. The stable critical point <j>* has zero energy. 



the energy initially stored in the D25-brane goes into closed strings. At the stable critical 
point <p* all particles that arise as open string excitations, including the tachyon, must dis- 
appear. This shows that the theory near <p* is quite subtle. Vacuum string field theory is an 
attempt to formulate string theory at the vacuum </;* where both D-branes and open strings 
disappear. 

Further interesting facts about tachyons and D-branes have emerged. It has been shown 
that D/z-branes with p < 25 are themselves large and coherent excitations of the tachyon 
field of the D25-brane. In some sense, D-branes are made of tachyons! This is also true, 
with minor modifications, in superstring theory. In superstring theory certain D-branes 
carry charge and therefore charge conservation ensures that they are stable against decay. 
In fact, the open string theory in the background of any such D-brane has no tachyons. 
However, a configuration consisting of a D-brane and a coincident, oppositely charged anti- 
D-brane, is unstable: the two objects can annihilate without violating charge conservation. 
The open strings which stretch from the D-brane to the anti-D-brane contain a tachyon - 
a superstring tachyon! This tachyon describes the instability of the D-brane/anti-D-brane 
pair. The study of D-brane/anti-D-brane annihilation plays an important role in attempts to 
use string theory to describe the early universe. It is thus possible that the tachyon will end 
up playing a prominent role in string cosmology. 



Problems 



Problem 12.1 Heisenberg equation for the momentum density. 

We verified in (12.21) that X 1 = 2n ot'V Tl follows from the Heisenberg equation if = 
[£, H], when § = X 1 . Calculate 'P rl . and use the result to verify that the classical equation 
of motion X 1 — X 1 =0 holds as an operator equation. 
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Problem 12.2 Testing explicitly some vanishing commutators. 

Use the mode expansion (12.32) and the commutation relations of the a to check explicitly 
that equation (12.26) holds: [X 1 ' (z, a), X J ' (r, a')~\ = [X 1 (r, a) , X J (r, a')] = 0. 

Problem 12.3 Testing explicitly the main commutator. 

(a) Use the explicit mode expansions of X 1 and V rJ , together with the commutation 
relations (12.45) and (12.54), to show that 

X 1 (r, a) , V xJ (r, a ') j = i r] IJ — cos na cos no' . 



(b) If the above result agrees with (12.10), we must have 

. 1 x 

8(<r -<*') = ~ Y 

IT L ' 



cos no cos no . 



(i) 



This equation follows from the completeness of the functions cos na with n > 0 on the 
interval a e [0, 7r]. The completeness is readily explained: any function f{a) defined 
over a e [0, : r] can be extended to a function over a e [— n, tt] by letting /(— a) = 
f{a) for a e [0, i r]. The resulting function is an even function of cr and by the basic 
result of Fourier series it can be expanded in terms of cosines. We can therefore expand 
any function f (a) with a e [0, n] as 

OO 

/(c) = E A n cos na. (2) 

n = o 

Prove (1) by calculating A n and substituting the result back into the right-hand side 
of (2). 



Problem 12.4 Analytic continuation of the zeta function. 

Consider the definition of the gamma function 1 ' (.v ) = J 0 °° dt e~'t s ~ l . Let t — »• nt in this 
integral, and use the resulting equation to prove that 

n OO — 1 

r(s)t(s)= dt- — SR(5)>1. 

Jo e' -l 



Verify also the small t expansion 

1 lit , 

= + — + 0(r). 

e l — 1 t 2 12 V ’ 

Use the above equations to show that for IH(s') > 1 

r(U c(.v) = 



[ 1 . j_! / i ii t \ 1 1 

J 0 t 2 12/ s-\ 2i 

i; 



i 



e< - 1 



(1) 



( 2 ) 



2s 12 (s + 1) 
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Explain why the right-hand side above is well defined for SH(.v) > —2. It follows that this 
right-hand side defines the analytic continuation of the left-hand side to 9f(s) > —2. Recall 
the pole structure of PCs') (Problem 3.6) and use it to show that £( 0) = —1/2 and that 
?(-l) = -l/12. 

Problem 12.5 The Virasoro algebra is a Lie algebra. 

A vector space L with elements x, y, z, ■ ■ ■ and a bilinear bracket [• , •], that takes two ele- 
ments of L and yields another element of L, is a Lie algebra if the two following conditions 
hold. 

(i) Antisymmetry: [x, y] = — [y, x ] for all elements x and y of L. 

(ii) Jacobi identity: [x, [y, z]] + [y, [z, x]] + [z, [x, y]] = 0, for all elements x, y, and z 
of L. 

Consider the vector space L spanned by the Virasoro operators with modes n e Z. Show 
that the commutators in (12.124), assumed to hold for all values of m and n, define a Lie 
algebra. Then, consider the commutators in (12.133), and show that they also define a 
Lie algebra. 

Problem 12.6 Consistency conditions on the Virasoro algebra central terms. 

The Virasoro commutation relations take the form 

L n 1 = ( m ~ n ) L m+n + C ( m )< Wn.O - (1) 

where C(m ) is a function of m that was calculated directly in this chapter. The purpose 
of this problem is to find the constraints on Cijn) that follow from the condition that (1) 
defines a Lie algebra. 

(a) What does the antisymmetry requirement on a Lie algebra tell you about C(m)l What 
is C(0)? 

(b) Consider now the Jacobi identity for generators L^, L- jy and Ljr with m + n + k — 0. 
Show that 

( m — n)C(k) + (n — k)C(m) + (k — m)C(n) = 0. (2) 

(c) Use equation (2) to show that C(m) = am and C(m) = fhn 3 , for constants a and /3, 
yield consistent central extensions. 

(d) Consider equation (2) with k = 1. Show that C(l) and C(2) determine all C(n). 
Problem 12.7 Exercises with Virasoro operators. 

(a) Use the Virasoro algebra (12.133) to show that if a state is annihilated by L\ and Lj it 
is annihilated by all L n with n > 1 . 

(b) Consider the Virasoro operators Lq, Li, and L_ i . Write out the three relevant commu- 
tators. Do these operators form a subalgebra of the Virasoro algebra? Is there a central 
term in here? Calculate the result of acting with each of these three operators on the 
zero-momentum vacuum state |0). 
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Problem 12.8 Virasoro operators acting on states. 

(a) Use (12.100) to write the state L^ 6 |0} as a finite sum of terms with oscillators acting 
on the vacuum. Do the same for L^ 2 L^ 2 |0). 

(b) Write L^L^ 2 ]0) and |0) as finite sums of terms with oscillators acting on 

the vacuum. Use the results to evaluate [L^ , /. j 2 ]|0). Confirm that your evaluation 
is consistent with the Virasoro algebra (12.133). 

Problem 12.9 Reparameterizations generated by Virasoro operators. 

(a) Consider the string at r = 0. Which of the combinations in (12.141) reparameterizes 
the a coordinate of the string while keeping r = 0? When r = 0 is preserved, the 
world-sheet reparameterization is actually a string reparameterization. Show that the 
generators of these reparameterizations form a subalgebra of the Virasoro algebra. 

(b) Describe the general world-sheet reparameterization that leaves the midpoint a — 7r/2 
of the r = 0 open string fixed. Express this reparameterization using an infinite set of 
constrained parameters. 

Problem 12.10 Reparameterizations and constraints. 

(a) Verify that the reparameterization parameters in (12. 137) satisfy the relations (omitting 
the subscript m for convenience) 

g-Z ^Ot j-<7 £-Zf 

(b) Think of the reparameterizations (12.138) generated by the Virasoro operators as a 
change of coordinates 

x' = x + e £ T (r, er) , a' = a + e £ ct (t, er) . 

Note that for infinitesimal e the above equations also imply that 

r = x' — e § T ( x' , o') , a = a' — e (x' , cr') . 

Show that the classical constraints 

a T v-9 ff v = o, (a T v) 2 + (9 tr z) 2 = o, 

assumed to hold in (r, cr) coordinates, also hold in (x', a') coordinates (to order e). 

Problem 12.11 Counting symmetric products.^ 

Prove that the number N of different products of the form 



where the superscripts I\, I 2 , . . ., I n run over k values 1, 2, ...,&, is given by 



N = 



(n + k — 



D! 



k k+ 1 

I ' 2 



k + 2 



k + n — 1 



n\ ( k - 1)! 



3 



n 
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Hint: one can represent each product using n identical balls and k 
For k = 6 and n — 9, for example, we have: 



1 identical dividers. 



•••••• 



I 444 5 66 

a a a~ a a a a a a . 



Problem 12.12 Unoriented open strings. 

The open string X^(r, cr), with a e [0, it] and fixed r, is a parameterized curve in 
spacetime. The orientation of a string is the direction of increasing a on this curve. 



(a) Consider now the open string X M (r, tt — o) at the same r. How is this second string 
related to the first string above? How are their endpoints and orientations related? Make 
a rough sketch showing the original string as a continuous curve in spacetime, and the 
second string as a dashed curve in spacetime. 

Introduce an orientation reversing twist operator £2 such that 

£2 X 7 (t, cr) £2 _1 = X 7 (r, 7 r — cr) . (1) 

Moreover, declare that 

C2 jc^“ £2 _1 = Xq , £2 p + £2 _1 = p + . (2) 

(b) Use the open string oscillator expansion (12.32) to calculate 

£ 2 xq £ 2 _1 , £ 2 ci (0 £ 2 _l , and £2 af, £ 2 _1 (n ^ 0 ) . 

(c) Show that £2 X~(r , cr) £2 _1 = X~(r, Tt — cr). Since £2 X + (x , a) £2 _1 = X + (r, it —ct), 
equation (1) actually holds for all string coordinates. We say that orientation reversal 
is a symmetry of open string theory because it leaves the open string Hamiltonian H 
invariant: £2//£2 _1 = H. Explain why this is true. 

(d) Assume that the ground states are twist invariant: 

£2 I p + , pt) = £2 _1 I F + . Pt) = I P + , Pt) ■ 

List the open string states for N 1 - < 3, and give their twist eigenvalues. Prove that, in 
general, 

£2 = (-1)^ . 

(e) A state is said to be unoriented if it is invariant under twist. If you are commissioned to 
build a theory of unoriented open strings, which of the states in part (d) would you have 
to discard? In general, which levels of the original string state space must be discarded? 



Problem 12.13 Tachyon potentials. 

Consider scalar held theories of the form 

S = J d°x(-^^-^)). 



( 1 ) 
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We will examine three different scalar potentials: 



v iw = 0 o ) 2 U + \<h), 

a' 30o ' 2 / 


(2 a) 




(2b) 




( 2 c) 



where </>o is a (positive) constant. For each of the three potentials V, , do the following. 

(a) Plot Vi (</>) as a function of 0. 

(b) Find the critical points of the potential and the values of the potentials at those points. 
Each critical point represents a possible background for the scalar field theory. 

(c) At each critical point 0 expand the action for fluctuations of 0 around this point, that 
is, let 0 = <f> + 0 where the fluctuation \j/ is small. The quadratic term in 1 // (with no 
derivatives) can be used to read the mass of the scalar particle. Give the mass of the 
scalar particle for each critical point. 

The potential V] is a rough model for the tachyon potential on an unstable D-brane. 
V 2 is an exact (effective) tachyon potential on an unstable D-brane. The potential V 3 is a 
rough model for the superstring tachyon potential on the world-volume of a D-brane and a 
coincident anti-D-brane. 




Except for shared position and momentum zero modes, the operator content of 
quantum closed strings can be viewed as two commuting copies of the open 
string operators. Even in the light-cone gauge, the reparameterization invari- 
ance cannot be fully fixed: there is no natural way to choose a starting point 
for a closed string. As a result, the closed string spectrum is subject to the 
constraint L q- — L jj- = 0, which selects the states that are invariant under rigid 
rotations of the string. We find that the massless closed string quantum states 
include one-particle graviton states, making string theory a quantum gravity the- 
ory. Additionally, we find massless Kalb-Ramond and dilaton states. The dilaton 
state controls the strength of string interactions. We study closed strings on the 
orbifold R V^2- 
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When it was first discovered, string theory was thought to be a theory of strongly inter- 
acting particles - a theory of hadrons. The consistency of open string theory required the 
inclusion of closed strings. But there was a problem with closed strings: among the exci- 
tations of closed strings there were massless states with spin two. No known hadron had 
these properties. Despite much effort, all attempts to eliminate these closed string states 
from the spectrum failed. 

It turns out that these massless states can be identified as graviton states, and physi- 
cists eventually realized that closed string theory could be a theory of quantum gravity. In 
this chapter we quantize the relativistic closed string and see how graviton states emerge. 
Much of the quantization procedure will resemble our quantization of the open string in 
Chapter 12, but there are a number of new features. Let us begin by recalling some of the 
important facts about closed strings that we learned in Chapter 9. We considered at that 
time a family of gauges (see (9.27)) defined by the conditions 



The second condition implies that the parameter cr spans an interval of length 2 it : 



n ■ X — a' (n ■ p) r , n ■ p = 2tc n ■ V T . 



(13.1) 



a e [0, 27 t] . 



(13.2) 



Here, a = 0 and a — 2n represent the same point on the closed string. The range a e 
[0, 2n] for closed strings is twice the range a e [0, tt] used for open strings. We found 
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that the conditions (13.1) do not fully fix the parameterization of the closed strings. Unlike 
open strings, closed strings do not have a special point that can be selected as a — 0. 
We used this arbitrariness to our advantage: once we selected some a = 0 on one closed 
string, we could impose the constraint X' ■ X = 0 by suitably choosing the a — 0 point on 
all of the other closed strings on the world-sheet. After this, we still had the ability to let 
a — » a + cto, with some constant cro that is the same for all strings. This rigid rotation of 
the lines of constant a is a reparameterization invariance of the closed string action that 
cannot be fixed. When we build the quantum states of the closed string, this will result in a 
constraint on states. 

The condition X' ■ X = 0, together with the parameterization conditions (13.1), implied 
X'~ + X 2 — 0. We thus obtained the familiar conditions 

(X±X') 2 = 0, (13.3) 



and the momentum densities became simple derivatives of the coordinates: 

= , V TIJ -=^—;X 11 . (13.4) 



2ita' 2 it a' 

Finally, all the string coordinates were seen to satisfy the wave equation: 






(13.5) 



Let us now consider the classical solution to the equation of motion for the closed string. 
The general solution to the wave equation is 



X^r.er) = x£(r + er) + X%(t - er) , (13.6) 

where x£ (the L stands for left-moving) is a wave moving towards more negative a and 
X 1 ^ (the R stands for right-moving) is a wave moving towards more positive er. For open 
strings, the left-moving and right-moving waves were related to each other by the boundary 
conditions at the endpoints. The closed string has no endpoints, but we have a periodicity 
condition to work with. The parameter space (r, er) for closed strings is a cylinder, so, to 
describe closed strings properly we compactify the world-sheet coordinate er : 



a ~ er + 2jt . 



(13.7) 



Two points on the world-sheet whose difference of a coordinates is a multiple of 2tc are 
the same point. We can in fact use any interval of the form [cro, <ro + 2tt] to describe the 
closed strings; the choice in (13.2) is one of the possible choices. When we include the r 
coordinate, the identification of points on the parameter space is given by 

(r, er) ~ (r, a + 2 tt) . (13.8) 

We demand that X 1 ' assumes the same value at any two coordinates that represent the same 
point on the parameter space: 
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X^(t, a) = X^ir, cr + 2n ) , for all rand a. (13.9) 



This condition is both easier to deal with and easier to interpret than the naive condi- 
tion X^(r, 0) = X ll (r, lit). The periodicity condition (13.9) is appropriate for strings that 
propagate in a simply connected space, a space in which every closed string can be contin- 
uously shrunk to a point. Minkowski space, for example, is simply connected. If a spatial 
direction is curled up into a circle, closed strings wrapped around the circle cannot be 
shrunk away. Space is then not simply connected, and the coordinate along the circle is 
not single valued. For such a coordinate, the periodicity condition (13.9) must be modified 
(see Chapter 17). 



We will now show that the periodicity condition (13.9) induces a small but significant 
constraint that relates the left-moving and the right-moving waves. Let us define two new 
variables, 



u = x + a , 
v — r — a . 



(13.10) 



In terms of these variables, equation (13.6) becomes 

X* = Xfru) + X»(v) . (13.11) 

When a — »• a + 2n , the variables u and v increase and decrease by 2n, respectively. As a 
result, the periodicity condition (13.9) gives 

X£(u) + X^(v) = X£(u + 2n) + X^ R (v - 2 7t) , (13.12) 

or, equivalently, 

X£(u + 2 jt) - X%(u) = X%(v) - X^(v - 2 jt) . (13.13) 

This equation establishes that the left-moving and right-moving waves are in fact depen- 
dent on each other: if one fails to be periodic, the other has to fail by the same amount. 
Since n and v are independent variables, both the u derivative of the right-hand side and 
the v derivative of the left-hand side must vanish. As a consequence, we find that both 
X'l (u) and X R (v) are strictly periodic functions with period 2tt (for functions of a single 
variable primes denote derivatives with respect to the argument). We can therefore write 
the mode expansions 

x R^ = \lwT,< e ~ inv ■ (13 - 14) 

neZ 

A set of barred a modes was introduced for the expansion of X^' (it). Even though they 
are written identically, the unbarred a modes used in the expansion of X R (v) have no 
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relation to the open string modes of Chapter 12. In closed string theory we need two sets 
of a modes, barred and unbarred. We integrate equations (13.14) to find 



R/jL 



/ a 7 

+ V 2 


- n 

“o 


. /a 7 ^ V-n 

u+, h X-' 






nji 0 


[o^ 

+ V 2 


\1 

«0 


. /a 7 

V 2 L — ' n 






n^O 


and . 


x^ lL have appeared as 



( 13 . 15 ) 



These are somewhat puzzling, after all, in open string theory there was a single coordinate 
zero mode whose canonical conjugate was the momentum of the string. We will see that 
only the sum of the two zero modes plays a role here. If the space is not simply connected, 
however, each of the coordinate zero modes plays a role, as we will see in Chapter 17. 

The aperiodicity of and of is a consequence of the linear terms appearing in 
(13.15). Condition (13.13) constrains these terms giving 




and therefore 



( 13 . 16 ) 




( 13 . 17 ) 



Owing to this equality, quantum closed string theory has only one momentum operator. As 
we will soon see, this means that canonical quantization works consistently with only one 
coordinate zero mode operator. 



We can now assemble the mode expansion for X^(r, a) by substituting (13.15) into (13.6): 

X"(r. „) = +/!«;( t + „) + iM J2 

njt 0 



1 



With q!q = cHq , we find 



- «£(r -v) + ij ^Y^e-inir-*) 
9 (O V 2 ^ n 

0 



( 13 . 18 ) 



X m (t, ct) = ~(x^ + x^ 1 ) + V2a J a£ x + i^J — ^ — — (<e" !cr + ma ) . 

n^O n 

( 13 . 19 ) 

As expected, X 11 is a periodic function of a with period 2n . The canonically conjugate 
momentum density is 



V z ^(r, CT) = a) = +■••), 



2 not’ 



( 13 . 20 ) 
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where the dots represent the terms in X 1 ' that integrate to zero over the interval a e [0, 2tt] 
and therefore do not contribute to the evaluation of the total momentum: 



r2n 

?** = / ■ 

Jo 



V r ^(T, a )da = 



1 r l7T 

= / da 

2na' Jo 



.a' aiJ = 



la. 

a 



(13.21) 



Thus we have the relation 




This differs from the corresponding open string result (12.46) by a factor of two, but the 
idea is the same: a q is proportional to the spacetime momentum carried by the string. 



There is only one momentum variable, and thus in the quantum theory there is only one 
momentum operator. We should also have only one conjugate coordinate zero mode. Thus, 
despite our left-right decomposition of the solution to the wave equation, xfr and x^ cannot 
both be independent variables. Only their sum appears in (13.19), so it must be the sum 
that is the relevant coordinate zero mode. Without loss of generality, we set 



x 



L/i 

0 




0 ’ 



(13.23) 



and equation (13.19) takes its final form: 



X M (r, a) = x q + V2q! , o!q r + i 



t£ : 

njt 0 



-« e 



U i«<T 



r ). (13.24) 



It is convenient at this stage to record the r and a derivatives of the coordinates. With 
the help of (13.6) we note that 

X* = x + a) + X£'(r - a ) , 

, , (13.25) 

X* = Xf (r + a) - X%(x - a) . 

Adding and subtracting these equations, and using (13.14), we find 



X M + X^' = 2X^\x + a) = V2a' e ~ in(z+a) , 

ne Z 

X V - X^' = 2X^(r -a) = J2d ■ 



(13.26) 



Note that the barred oscillators do not mix with the unbarred oscillators in these combi- 
nations of derivatives. We have tailored the normalization constants to arrive at the above 
relations. They are completely analogous to the open string expansions (12.33). This will 
allow us to obtain some closed string commutators without doing any new computations. 
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Let us now turn to the quantization of the closed string theory. The canonical commuta- 
tion relations take the same form as in open string theory. For the transverse light-cone 
coordinates and momenta we set 

[x 7 (r, a) , V Tj (r , ct')] = iS(a - a')r, IJ , (13.27) 

and, as usual, we set to zero the commutator of coordinates with coordinates and the com- 
mutator of momenta with momenta. For zero modes, we also have [xT , p + ] = —i. Since 
the commutation relations did not change, equations (12.30) and (12.31) are valid here. 
The first of these is 

[ ( X 7 ± X 7 ')(t, a) , (X J ± X j ')(t , a') ] = ±4 tux' ir] IJ —S(a - a') . (13.28) 

da 

In fact, this time the situation is simpler. Equations (13.28) hold for a, a' e [0, 27r] since 
the string coordinates are defined for this full interval. Moreover, the mode expansions 
(13.26) also hold for the full interval. Since the combinations of derivatives take the same 
exact form as they did for open strings, the above equation leads to identical looking com- 
mutation relations. The oscillators, however, are barred when we use the top sign and 
unbarred when we use the lower sign. The result is therefore 







— Wl &m+n , 0 *1 5 








— Wl &m+n , 0 ‘H 


(13.29) 



On account of the expansions (13.26), we call the a operators left-moving operators, and 
we call the a operators right-moving operators. Each of these sets matches the operator 
content of an open string theory. The commutation relations also take the form we would 
expect from open string theory. Closed string theory thus has the operator content of two 
copies of open string theory, except for zero modes. The momentum zero modes are equal 
(ag — a g), and there is only one set of coordinate zero modes x^, Xq . 

Equation (12.31) states that combinations of derivatives with opposite signs commute. 
In the present case this leads to the result that left-moving and right-moving oscillators 
commute: 



[a' m ,a J n ] =0. (13.30) 

We can define canonical creation and annihilation operators just as we did for open strings: 

a 7 = a l n y/n and a!_ n = a ^ ~J7i , n > 1 , 

ci 7 = a 7 yfn and a I _ n = a 7t yfn , n > 1 . 



(13.31) 
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The nonvanishing commutation relations are then the expected ones: 



[ 5 m = 



= S„ 



JJ 



[ a !n 






= s„ 



,n 



IJ 



(13.32) 



The commutators that involve can be found following steps analogous to those used 
for open strings. This time (Problem 13.1) we find that [jCq, a 2 ] and [xq , a 2 ] vanish when 
n ^ 0, and 



[ x o ’ “o — [ 



— 1 x o ’ a o 




(13.33) 



where the expression to the right arises because of (13.22). 

What is the light-cone closed string Hamiltonian? We know that p~ generates X + 
translations, and that, in addition, X + = a'p + r. As a result, d T — a' p + dx+, and the 
Hamiltonian must be given by 

H = a'p + p~ . (13.34) 



In order to find the normal-ordered version of this Hamiltonian, we now turn to the 
transverse Virasoro operators of closed string theory. 



13.2 Closed string Virasoro operators 



We learned in Chapter 12 that the open string transverse Virasoro operators are essentially 
the modes a~ of the light-cone coordinate X~ . For closed string coordinates we have two 
types of modes, barred and unbarred. This is also true for the closed string X~ coordinates: 
we have a~ and a~ modes, and therefore we expect to have two sets of Virasoro operators. 
On account of (13.17), however, we have a ( ~ = , so a surprise awaits us with regards to 

the Virasoro operators with mode number zero. 



To begin our analysis we need an expression that relates X to the transverse coordinates. 
The requisite formula is (9.65), with = 1, as appropriate for closed strings: 

X~ ±X~' = --^—{X 1 ±X 1 ') 2 . (13.35) 

a' 2 p + 

We define Virasoro operators following the pattern in equation (9.79): 



(X 1 + X 1 '? = 4a' £(± £ ay n _ p )e~"^ 

neJj peJj 

(X 1 - X 1 ') 2 = 4a' E <* I p‘Xn- p y in(t - a) 

neZ peZ 



4a' J2 L^e- in(z+a) 
neZ 

4a' L^e~ in(T - a) 
neZ 



(13.36) 
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In each of the above lines, the equality requires a small calculation using (13.26), and the 
second relation is a definition. More explicitly, 

Ln = \ Y a p a l-p ’ L n = \ Y a p a n-p ■ d 3 ' 37 ) 

peZ peZ 

These are the two sets of Virasoro operators of closed string theory. Plugging the definitions 
in (13.36) back into (13.35) we obtain 

X~ + X~' = e~ in(T+(7) , X~ - X~' = \ Y e“" !(r “ ff) . (13.38) 

p+ ' p+ ' 

y neZ r ne Z 

On the other hand, the derivatives of X ~ , as those of any other closed string coordinate, 
can be expanded along the lines of (13.26) to give 

X~ + X~' = Y K e~ in{T+a) , X~ - X~' = 72a 7 Y < e“''" (r_cr) . (13.39) 



We compare equations (13.38) and (13.39) to read the expressions for the minus oscillators: 



y/la' a n = 





(13.40) 



For n — 0, however, there is a constraint. Since a () — a Q , we have the level-matching 
condition 



4 = ^0- ( 13 - 41 ) 

If you look at the definitions of /) ( | and in (13.37), you will realize that these two 
operators are clearly very different from each other. What does it mean that they must 
be equal? Since operators are ultimately defined by their action on states, the meaning of 
the equality (13.41) is that any state |k, X) of the closed string must satisfy L^\X, X) = 
L X , X). This is therefore a constraint on the state space of the theory: “states” that do not 
satisfy this constraint do not in fact belong to the state space. 

To fix the ordering ambiguities in the operators and L {) we define them to be ordered 
operators without any additional constants: 

Z£=^V + * X . 4 =jP I P 1 + N ± . (13.42) 

Here N 1 - and /V 1 are the number operators that are associated with the barred and un- 
barred operators, respectively: 

OO OO 

= Y n ~ a n ~ a 'n ’ A l^ = Y na n a n- ( 13 ' 43 ) 

n = 1 n = 1 

While we will not go through the trouble of proving it, the critical dimension for closed 
strings turns out to be D — 26. This follows from the requirement that the quantum theory 
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be Lorentz invariant. It is no coincidence that the critical dimension for closed strings coin- 
cides with the critical dimension for open strings. It means that both types of strings can 
coexist. In fact, since open strings can, in general, close to form closed strings, it would 
have been quite strange if the critical dimensions did not agree. 

The constant ambiguities due to the ordering of and are also fixed by the condition 
of Lorentz invariance, just as it happened for open strings. The answer could be anticipated, 
since the left and right sectors of closed string theory behave like open strings. In addition, 
the naive argument based on zeta functions suggests that the ordering constants for and 
Lq are the same and equal to that for the L f j operator of the open string. These constants 
are included in the relation between <Xq and L {) and in the corresponding barred relation. 
Therefore equations (13.40), for n = 0, become 

VWa~ = -^(Lq - 1) , VWu~ = A (L ^ _ i) . (13.44) 

The level-matching constraint = L^, which emerged from a ( ~ = a^, remains 
unchanged by the constant shifts. Using (13.42), this constraint can be written more 
simply as 

N ± = N ± . (13.45) 

Averaging the two expressions for a ( J in (13.44), we can find a symmetric expression: 

V2^a~ = ~^(L^ + -2) =a'p~ , (13.46) 

where the relation to p~ follows from (13.22). With p~ known, we can calculate the 
mass-squared: 

M 2 = -p 2 = 2 p+p- - p'p 1 = 4(^0 + ^0 - 2) - p'p' ■ (13-47) 

or 

Substituting the values of Zq and given in (13.42) yields 

2 

M 2 = — ( AH- + 1\H- - 2 ) . (13.48) 

a' 



This is the mass formula for closed string states. The closed string Hamiltonian (13.34) can 
be written in terms of Virasoro operators using (13.46). The result is very simple: 

H =a'p + p~ = + Lfr - 2. (13.49) 

This Hamiltonian is the sum of an “open string” Hamiltonian — 1 for the right-moving 
operators, and an “open string” Hamiltonian — 1 for the left-moving operators. Using 
(13.42) the Hamiltonian can be written as 
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//=^- /) y + iV i + JV i -2. (13.50) 

Both the 4 operators and the 4 operators satisfy the Virasoro algebra (12.133). In addi- 
tion, the commutators between barred and unbarred Virasoro operators vanish. So, the full 
set of closed string Virasoro operators define two commuting Virasoro algebras. 



We conclude this section with a study of the Virasoro action on closed string coordinates. 
The commutation of the closed string Virasoro operators with the oscillators follows the 
pattern of equation (12.1 18). We have 



[ 4 . «n ] = ~ n «m+n - [ 4 > a n ] 



-« «m+n ' 



and, in addition. 



[4>«n] = [ 4 , a{ 



= 0. 



On the other hand, both L ^ and 4 have a nontrivial commutator with : 



(13.51) 

(13.52) 



Quick calculation 13.1 Verify that 




(13.53) 



Let us focus here only on the action of 4 and on the string coordinates. The required 
formulae are obtained in the following exercise: 



Quick calculation 13.2 Verify that 

[4,A 7 (t,<t)] =- l -(X I + X 1 '), [zy,X 7 (t,a)] =- l -(X I -X 1 '). (13.54) 



Adding the two equations in (13.54), we find 

f i - , , i dX 1 

[4 + 4, X 7 (r,a)J = -i— . (13.55) 

This equation is consistent with the Heisenberg equation of motion for A 7 , since the closed 
string Hamiltonian is (4 + 4 — 2). Subtracting the two equations in (13.54), we find a 
more surprising result: 



[ 




A 7 (t ,cr) 



dX 1 

i . 

3 a 



(13.56) 



This equation shows that 4 ~ 4 g enerates constant translations along the string. Indeed, 
for infinitesimal e, 

X 1 (t, a) + [-ie(4 - 4) ’ x * ( r ’ CT )] = X 1 (r, a + e) . (13.57) 



More generally, a finite translation along the string can be obtained by acting on the string 
coordinate with exponentials of 4 ~ 4' Writing 

Z> = 4-4, (13.58) 
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we find that (Problem 13.3) 

e ~ iPrro X I (r,a) e iPa ° = x\x,a + er 0 ) , (13.59) 

for any finite ero. For ero = e, infinitesimal, this general result reduces to (13.57). The oper- 
ator P generates the one reparameterization symmetry that cannot be fixed even in the 
light-cone gauge. Since P annihilates all closed string states (see (13.41)), we conclude 
that closed string states are invariant under rigid a translations. By this we mean that 
exp(— i _P(7o)l'I / ) = I'P), for any closed string state |'P). 

In the two-dimensional (r, er) parameter space of the closed string world-sheet, the 
operator L {) + L {) is a generator of r translations. It is therefore a world-sheet energy. 
Since the gauge condition relates r to the light-cone time, this world-sheet energy turns 
out to give us the spacetime Hamiltonian, the generator of light-cone time evolution. The 
other combination L q — P ( J = P generates translations along the world-sheet coordinate 
a . It can therefore be viewed as a world-sheet momentum. This momentum should not be 
confused with the spacetime momentum of the string. For closed string states the world- 
sheet momentum must in fact vanish, and this is a nontrivial constraint. States with nonzero 
momentum along a can be built, but they do not belong to the closed string state space. 



13.3 Closed string state space 



We are now ready to build the state space of the quantum closed string. The ground states 
are \p + . pr) and they are annihilated by both the left-moving and the right-moving anni- 
hilation operators. To generate all of the basis states we must act on the ground states with 
the creation operators ah ' and ah'. The general candidate basis vector is 



co 25 



oo 25 



IA-.A.) = 



[niw^Hniw 



Ip- pt ) ■ 



(13.60) 



n = 1 1=2 



m= 1 7=2 



Just as with open strings, the occupation numbers X n j and X n j are non-negative integers. 
The number operators act on |A, X) with eigenvalues 

co 25 co 25 

N ± = J2J2 nX "’ I ' N ± =J2J2 mlm ’ J - (13 - 61) 

n= 1 7=2 m= 1 7=2 

Except for the momentum labels, the above states are those that one would obtain by 
combining multiplicatively arbitrary states built from the left-moving and from the right- 
moving operators (compare with (12.162)). Not all of the states in (13.60) belong to the 
closed string state space. The constraint (13.45) must be satisfied by the true states of 
the theory. A basis vector | A., X) belongs to the state space if and only if it satisfies the 
level-matching constraint 



N 1 - — N 1 - . 



(13.62) 
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r 

Table 13.1 




The states with N 1 - 


+ N 1 - < 2 in the closed string spectrum 




iV- 1 , N 1 - 
0.0 
1, 1 


M ) 

\p + , pt) 

a('d J ^\p + , p T ) 


\a'M 2 

-2 

0 


Number of states 

1 

(D - 2) 2 


Wavefunction 

p+, p T ) 

fiji.*, P + > pt) 



This constraint cannot be “solved”. It is clear that eliminating some oscillator from the 
list of operators that can act on the ground states is no help whatsoever. A better strategy 
would be to select a particular oscillator and attempt to use it, as many times as needed, to 
fix any state that does not satisfy (13.62). Not even this is possible, because any oscillator 
can only contribute a positive amount to the number operator. The constraint (13.62) must 
be implemented in a case-by-case fashion. The masses of the states are obtained from 
(13.48): 

^aM 2 = N J ~ + N ± - 2. (13.63) 

Let us identify the first few basis states, give their masses, and explain what fields they 
represent. The results are tabulated in Table 13.1. 

The ground states in the first row of Table 13.1 are the one-particle states of a quantum 
scalar field. For such states N 1 - = N 1 - — 0 and M 2 = —Ala! < 0, so these are closed 
string tachyons; they are in fact completely analogous to the tachyons of open string the- 
ory. The mass-squared of the closed string tachyon is four times larger than that of the 
open string tachyon. The closed string tachyon is far less understood than the open string 
tachyon. In particular, the closed string tachyon potential has not yet been calculated reli- 
ably. The instabilities associated with closed string tachyons are expected to be instabilities 
of spacetime itself. They remain largely mysterious. 

The next excited states must be built with two oscillators acting on the ground states. This 
is because we must satisfy the constraint N 1 - = N 1 . One oscillator must be from the left- 
sector and one from the right-sector, both with the lowest possible mode number - mode 
number one. This gives the states described in the second line of the table. All these states 
have M 2 = 0, so they are of great interest. Since I and J are completely arbitrary labels 
attached to different oscillators, the number of states is ( D — 2) 2 . 

Let us consider the general state of fixed momentum at the massless level. We write it as 

T. RiJ Pt) ■ ( 13 . 64 ) 

i,J 

Here Rjj are the elements of an arbitrary square matrix of size ( D — 2). Any square matrix 
can be decomposed into its symmetric part and its antisymmetric part: 



RiJ = - ( RiJ + Rji) + - (RiJ - Rji ) = Sij + Au , 



(13.65) 
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where Sjj and A jj are the symmetric and antisymmetric parts of R u, respectively. The 
symmetric part Sjj can in fact be decomposed further: 

SiJ = (Sn --^—^8 I jS) + -^—^8ijS, S = S U = & IJ Su. (13.66) 

The first term on the right-hand side is traceless: 

S IJ (Su - s) = S- S,J 8 IJ S = 0, (13.67) 

since 8jj 8 IJ = D — 2. Therefore (13.66) decomposes .S’/./ into a traceless matrix plus a 
multiple of the unit matrix. Let Sj j denote the traceless part of 57 j and let 5' = S/(D — 2). 
All in all, we have decomposed Ru as 

Rij = S,j + A u + S'8 U . (13.68) 

This is the standard decomposition of a matrix into a symmetric-traceless part, an antisym- 
metric part, and a trace part. Each of the three pieces can be specified independently when 
writing a general matrix Rij. Therefore, the states in (13.64) can be split into three groups 
of linearly independent states: 



E5//afa/V, PT ) , 


(13.69) 


i,J 




E^/y«r«fV + i pt) , 


(13.70) 


i,j 




S 1 I p + , pt) ■ 


(13.71) 



We now make a remarkable claim: the states (13.69) represent one-particle graviton 
states! We examined one-particle graviton states in Section 10.6. In the quantum theory of 
the free gravitational field these states are given by ( 10 . 1 10 ): 

D— 1 

E v!/> r i Q >’ (13 - 72) 

/, 7=2 

where 7/,/ is an arbitrary symmetric traceless matrix. Since 5/ / is also a symmet- 
ric traceless matrix, the identification of states is possible if we identify the basis 
states: 

a?a{\p+,p T ) <-► a l fi pr |£2). (13.73) 

This identification is possible because the two sets of states have the same Lorentz labels, 
they carry the same momentum, and they have the same mass (both zero). This shows that 
the closed string has graviton states. Gravity has appeared in string theory! We never put 
in a dynamical metric and we never spoke about general covariance, yet somehow, the 
quantum states of the gravitational field have emerged! 

The set of states in (13.70) corresponds to the one-particle states of the Kalb-Ramond 
field, an antisymmetric tensor field B^ v with two indices. The light-cone analysis of this 
field was discussed in Problem 10.6 (see, in particular, parts (e) and (f)). The B llv field is 
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in many ways the tensor generalization of the Maxwell gauge field A ^ . The Kalb-Ramond 
field couples to strings in a way that is analogous to the way that the Maxwell field couples 
to particles. Thus, as we will see in Chapter 16, strings carry Kalb-Ramond charge. 



There is one state that remains to be examined. The oscillator part of (13.71) has no free 
indices (/is summed over), so it represents one state. It corresponds to a one-particle state 
of a massless scalar field. This field is called the clilaton. 

The above discussion of particle states is supplemented by an analysis of wavefunctions 
and field equations. Such analysis follows closely the treatment in Section 12.7. Wavefunc- 
tions i// / / ( r , p + , pj-) describe the general time-dependent states at the massless level of 
the closed string: 

I'h, r) = J dp + dp T P + , PT)a{ ] a( p + , p T ) ■ (13.74) 

The Schrodinger equation satisfied by the states is i d T | T, r) = //IT', r). Using (13.50) 
and noting that for the states in question /V = /V L = 1, we find 



■ d fu a ' K K , 

i —, — = -r P P V' // • 
3r 2 



(13.75) 



The wavefunctions i[ru(r, p + , pj) become the fields of the classical field theories, with 
the Schrodinger equations interpreted as classical field equations. The symmetric-traceless 
part of i j/i j becomes the graviton field, the antisymmetric part becomes the Kalb-Ramond 
field, and the trace part becomes the dilaton field. The Schrodinger equations for the wave- 
functions of the graviton states, the Kalb-Ramond states, and the dilaton state, are all 
included in (13.75), and can be separated by selecting the symmetric-traceless compo- 
nents, the antisymmetric components, and the trace component of x/fij. On the other hand, 
in light-cone coordinates the massless scalar field equation 3 2 cp = 0 takes the form (10.30): 



(i ^ P K p K ^<p(x + , p + , p T ) = 0. (13.76) 

Setting x + = a'p + r, we find 

(i P + ,Pt) = 0. (13.77) 

This equation takes the same form as (13.75). In fact, in the light-cone gauge, graviton 
fields, Kalb-Ramond fields, and the dilaton field, all satisfy the simple equation 3 2 (p" = 0, 
where the dots refer to the relevant indices. This is manifestly true for the massless dilaton, 
which is a scalar. For graviton fields, it was demonstrated in equation (10.107). For Kalb- 
Ramond fields, the equation p 2 B llv = 0 was shown to hold in the light-cone gauge as part 
of the analysis in Problem 10.6. 



In summary, at the massless level of the closed string we found gravity fields, Kalb- 
Ramond fields, and dilaton fields. Each of these fields deserves intense study. The gravity 
field is studied in general relativity. We will focus on the Kalb-Ramond field in Chapter 16. 
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The dilaton is a massless scalar field with surprising properties. A proper study of the dila- 
ton belongs to an advanced course, but in the following section we discuss the role that it 
plays in string theory. 



13.4 String coupling and the dilaton 



The massless scalar field called the dilaton has a fascinating property: its expectation value 
controls the string coupling! This coupling is a dimensionless number that sets the strength 
of string interactions. 



The classic example of a coupling constant is the electromagnetic fine-structure constant 
a = e 1 /(Arche) — 1/137. This dimensionless coupling constant controls the strength of 
electromagnetic interactions. In the hydrogen atom Hamiltonian, for example, a appears 
in the term which gives the electrostatic interaction energy between the proton and the 
electron. The mass-scale in the hydrogen atom is set by the mass m of the electron. Phys- 
ical, dimensionful quantities, such as the binding energy E of the ground state, depend on 
the dimensionful parameter m of the theory, the fundamental constants h and c, and the 
dimensionless coupling a: 




4-7T 2«o 



— Y 

2\4jthcJ 



me 




(me 2 ) , 



(13.78) 



where ao — Arch 2 /me 2 is the Bohr radius. In the hypothetical limit a -> 0, the binding 
energy vanishes, and the Bohr radius is infinite. This is what becomes of the hydrogen 
atom when we turn off electromagnetic interactions. 



In string theory the story is not so different, at first. The dimensionful parameter can be 
taken to be a', which defines the string length l s = Vcd (working with h = c = 1). Let g 
denote the dimensionless coupling for closed string interactions. If g was set to zero, then 
strings would not interact. Interactions in gravity are determined by the value of Newton’s 
gravitational constant. If closed strings do not interact, gravitation would emerge without 
interactions, and the value of Newton’s constant in string theory would be zero. Since it 
vanishes when g -> 0, Newton’s constant is naturally expected to be proportional to some 
positive power of g; it turns out that it is proportional to g 2 . Dimensional analysis fixes 
the a' dependence of Newton’s constant. Equation (3.108) shows that the D-dimensional 
Newton constant G 1 11 1 has (natural) units of L D ~ 2 . In fact, in natural units, G (,y> is equal to 
the D-dimensional Planck length l 1 to the power (D — 2). So [G - 26) ] = L 24 , and since 
|V] = L 2 , we find 

G (26) ~ g 2 (a') n . (13.79) 

Most phenomenological studies of string theory begin with ten-dimensional superstring 
theories. These theories contain both bosonic and fermionic excitations, so they include 
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the two types of particles that we observe in nature. The ten-dimensional Newton constant 
G (10) in superstring theory is given by 

G (10) = (4 10) ) 8 ~ g 2 (a') 4 . (13.80) 

It then follows that 

4 10) ~g 1/4 V^ = g 1/4 ^. (13.81) 



If the string coupling g is a small number, the string length is larger than the Planck 
length. If g is of order unity, the string length and the Planck length are comparable. 
To find four-dimensional implications, we use the relation (3.116) between higher- and 
lower-dimensional Newton constants in a compactification. Assume that six of the dimen- 
sions of the ten-dimensional world are curled up into a space with volume V ((y> . Then, the 
four-dimensional Newton constant G is related to the ten-dimensional one by 



r = ^ , 1 

y(6) 8 01 y(6)/( a ')3 ■ 



(13.82) 



The ratio V (<,> /(a/) 1 ' is a dimensionless number that is typically assumed to be large. For 
a compactification of fixed volume, as measured in units of the string length, the four- 
dimensional Newton constant behaves like 



G 



g 2 a’ 



(13.83) 



In a theory with both open and closed strings, the open string coupling g a is actually 
determined in terms of the closed string coupling g. One can prove that 

ti-g- (13-84) 

This relation arises due to certain topological properties of two-dimensional world-sheets. 



The coupling which controls the strength of an interaction may sometimes fail to be 
constant. Consider adding to a free Hamiltonian Hq an interaction term gH ml that is pro- 
portional to a dimensionless coupling g. If g is declared to be a constant, you must specify 
its value by hand in order to define the complete Hamiltonian Hq + g// mt . But suppose 
that g is not a constant but rather a dynamical variable g(t), and that the full Hamiltonian 
includes an additional term H g that gives dynamics to g. In this case you cannot specify 
the coupling g arbitrarily by hand. The coupling would be determined, perhaps uniquely, 
or perhaps not, by the Hamiltonian equations derived from Ho + gH m t + H g . If g{t) is 
uniquely determined, no choice is needed. If g(t) is not uniquely determined, some other 
criterion may be needed to select the physically realized solution. 



The situation in string theory is similar; the string coupling can fail to be a constant. The 
closed string coupling g is determined by the value of the dilaton field <f> (x ) : 



g 



(13.85) 



It follows that, in principle, the string coupling g is not an adjustable parameter of string 
theory. Rather, it is a dynamical parameter - a field, in fact. This is an ideal property in a 
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unified theory of all interactions, for it holds the promise that the string coupling may be 
calculable. On the other hand, it seems clear that the string coupling is not uniquely deter- 
mined in string theory. Depending on the values taken by the other fields in the theory, the 
dilaton field may evolve in different ways. Under certain circumstances, the dilaton expec- 
tation value may even be an adjustable constant. One attractive possibility is that the other 
fields in the theory generate a potential for the dilaton. If this potential has a stable critical 
point the dilaton may be set equal to the critical value. Even more, the dilaton field would 
then acquire a mass. This is necessary for a realistic model of physics because there are no 
known massless scalars in nature. 

If the string coupling is small, the quantum mechanical amplitudes for string interactions 
can be calculated accurately using known results about Riemann surfaces. The fascinat- 
ing properties of Riemann surfaces allow one to understand why the infinities that plague 
quantum amplitudes in general relativity do not appear in string theory. This subject will 
be discussed in Chapter 26. 



13.5 Closed strings on the R 1 /^ orbifold 



We introduced orbifolds in §2.8 and stated that they are nontrivial spaces on which string 
propagation is tractable. In this and the following section we demonstrate this claim for 
the case of closed strings on the simplest orbifold, the half-line R 1 /Z 2 . Since we must 
deal with the full quantum string theory, the orbifold direction is just one direction in the 
26-dimensional spacetime. This direction x 25 = x, is effectively restricted to x > 0 by the 
1.2 identification 

x ~ — x . (13.86) 

No other coordinate is affected. The orbifold theory does not require boundary conditions 
at x — 0. The orbifold theory is defined by imposing a natural restriction on states of closed 
strings that live on the spacetime before orbifolding - a restriction on the spectrum of the 
original parent theory. 

Writing X 25 (r, a) = X (r. a), the collection of string coordinates is X + . X~ , X' , with 
i = 2, . . ., 24, and X. We introduce an operator U that implements on the string coor- 
dinates the Z 2 transfomation that defines the orbifold. Thus, acting on X we must 
have 

UX{x,g)U~ { = -X{x,o) . (13.87) 

Since U should not transform any other string coordinate we demand 

UX‘(t, a)U~ l = X ! '(t,ct). (13.88) 

Moreover, we demand the invariance of p + and x ( ~ : 

Up + U~ 1 = p + , and UxqU~ 1 =Xq, 



(13.89) 
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+ 



A closed string state invariant under x -»• -x is obtained by superposition of two states 
that go into each other as x ^ -x. 

since these, together with (13.87) and (13.88), imply the invariance of X + and X~: 

UX ± {x,o)U~ l = X ± (t,(t). (13.90) 

Indeed, the invariance of X + = a' p + r follows directly from the invariance of p + . The 
invariance of X~ holds because it is the sum of the invariant mode x(J~, terms quadratic 
in the invariant coordinates, and terms quadratic in X, X’ (see (13.35)). It follows that p~ 
is invariant under the U action. Since /; + is also invariant, the Hamiltonian H = a' p + p~ 
(see (13.49)) is invariant: UHU~ l = H. This means that U is a symmetry of the closed 
string theory. 

The orbifold closed string theory keeps only U -invariant states of the parent theory. Since 
U is a symmetry, the truncation to U -invariant states is a consistent reduction: if the Hamil- 
tonian were not U invariant, states that are U invariant at one time need not remain U 
invariant for all times. Intuitively, for [/-invariant string states, the physics at — x is deter- 
mined by the physics at x, thus effectively making half of the space irrelevant. U -invariant 
states are naturally constructed by quantum superposition (Figure 13.1). 

To implement the restriction to U -invariant states it is convenient to determine the action 
of U on the oscillators. The coordinate X(r, er) has the usual mode expansion (13.24) of a 
closed string coordinate 

X(t, er) = x 0 + a! px + iJ °^- ^ e —— (u„e' na + a„e _!ncr ) . (13.91) 

Z njtO 

If we are to have equation (13.87) for all values of r and a we must have 
U xo U~ l — — xo , U p U~ l — —p , 

(13.92) 

U a„ U = —a„ , U a n U = —a n . 

All operators in the expansion of X change sign under the action of U. All modes in the 
expansion of the coordinates X 1 are left invariant by the action of U. 

Quick calculation 13.3 Justify UHU~ l = H directly from the oscillator expansion of 
H = Ljj- + Lq — 2. 

Let us now discuss the states of the theory, beginning with the ground states. We denote 
the ground states of the parent theory by | p + , p, p), where p is a vector with components 







298 



Relativistic quantum closed strings 



p' and p denotes the momentum in the x direction. We also assume that states that have 
no momentum in the x direction are invariant under the action of U : 



Being invariant under U, the state \p + , p, 0) is a ground state of the orbifold theory, but 
there are many more. To find these, we first derive the U action on /; + , p. p). We assume 
that for every pair of conjugate coordinate and momentum operators ( q , p) momentum 
states are defined to satisfy | p + Sp) = exp (iSp q)\p), with Sp a constant. Applied to our 
pair (xo, p), we find |p + , p , p) = e IX0P \p + , p, 0) and, therefore, 



U\p + ,p, p) = U e ix ° p U~ x U\p + , p, 0) =e i( ~ Xo)p \p + ,p,0) = \p + ,p,-p), (13.94) 



More precisely, orbifold ground states are general time-dependent superpositions of the 
above states: 

J i/c(r, p + , p, p) \p + , p, p)dp + dp dp, f(T,p + ,p,-p) = i/f(T,p + ,p,p). 



The Fourier transformed ground state wavefunctions t/t(t, p + , p. x) are even functions of 
x. The massless states of the orbifold theory require N 1 - = /V 1 = 1. Indeed, orbifolding 
does not change the mode expansion of X and the formula for M 2 is not changed. To build 
the states we need two oscillators, one barred one unbarred, acting on suitable ground 
states. Basis massless states are given by 



All these states are U -invariant. For those that have an odd number of X oscillators we use 
a combination of vacuum states with U = — 1 so that the full state has U = + 1 . Massive 
states can be constructed similarly. It would seem that this is the end of the story, but there 
is a surprise. The orbifold theory includes more states than the ones discussed above. It con- 
tains a twisted sector with a new kind of closed strings. Let us examine these strings now. 



U\p + ,p,0) = \p + ,p,0). 



(13.93) 




(13.95) 



(13.96) 



(13.97) 



(lP + - P > P) + I F + , P, ~p}), 
d-lOt - 1 (|p + , p, p) - |p + , p, -p)), 
«-i ct l _ x (|p + , p, p) - |p + , p, -p)), 

a-ia-i (I P + , p, p) + I p + , p, ~p)). 



13.6 The twisted sector of the orbifold 



The additional closed strings that appear in the so-called twisted sector can be imagined as 
open strings in the parent theory, but with endpoints at locations identified by the orbifold 
condition (13.86). More precisely, we write 
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A "twisted" closed string satisfies X(t, a + 2k) = -X(t, a). The string for a e [0, 2k) is 
shown as a continuous line and the part a e [2jt, 4jt] as a dashed line. The vertical 
coordinate represents any x'. 



X(r, a +2tt) = -X(r, a). (T3.98) 

This equation tells us that X(r, 2k) — —X(r, 0), which means that the orbifold identifi- 
cation makes them the same point and the string becomes effectively closed. For arbitrary 
a the equation tells us that the string effectively repeats its position when a is increased 
by 2n. In the parent space the string only closes after a -» er + 4k , as is clear from two 
applications of (13.98). A two-dimensional representation of a string obeying (13.98) is 
shown in Figure 13.2. We note that a twisted closed string must go through x = 0. This is 
clear because the continuous function X(r, a) in (13.98) takes both positive and negative 
values. 

To develop the quantum theory of this sector we first find the appropriate oscillator 
expansion for the coordinate X that satisfies (13.98). As usual, to solve the wave equation, 
we write 



X(r, a) = Xl(u) + X R (v ) , u = r + a , v = r — a. (13.99) 

The constraint (13.98) implies that 

X L (u + 2k) + X R (v - 2k) = -X L (u) - X R (v), (13.100) 

or, equivalently, 

X L (u + 2k) + X L (u) = ~(X R (v) + X R (v - 2k)). (13.101) 

Taking derivatives with respect to u and v we find 

X' l (u+2k) = -X' L (u), X' r (v + 2 k) = -X’ r {v). (13.102) 



For an ordinary string coordinate X) and X l R ' are periodic functions with period 2k. For X 
the derivatives reverse sign when the argument changes by 2k . To write a mode expansion 
that is conveniently normalized we mimic (13.14). In order to get a sign change when 
u —> i / + 2k we need exponentials of the form exp (iku) with k half-integer. We therefore 
write 
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n€Z 0 dd 



X' R (V) = ^ U" i e~ i ? v - (13-103) 

fi €:Z 0 dd 

The labels on the oscillators, now half-integers, match the factors of u or v in the 
exponentials. Integrating (13.103) we get 



X L {u) =x L + i 



X R (v) =x R + i 







an e l 2 u 
2 



-a>±e l z v . 
n 2 



(13.104) 



Equation (13.101) now gives xr = —x r, or xr+ xr = 0. Putting the left- and right- 
moving parts together we arrive at the mode expansion for the twisted sector closed 
strings: 

X(r, a) = iJ — -e 1 1 + ane , 5 rT ) . (13.105) 

}1 G^odd 

The expansion contains neither coordinate nor momentum zero modes. It costs energy to 
get away from x — 0 since at least one point on the string must remain at x = 0. As we 
will see later, the fields associated with twisted states are localized at the fixed point x = 0 
and depend only on x + , x ~ , and x l . 



The commutation relations for the modes can be derived in complete analogy to the 
usual case. The coordinate obeys [A(r, a ) , V T (x, (J 1 )] = iS(<j — a'). Moreover, given our 
mode expansions the coordinate X satisfies (13.26) with n n/2 and sums over odd 
numbers: 



X + X' = 2X' L (j+a) = 4la' ^ an e~ i ^ r+cr) , 

«gZ 0( id 

X - X' = 2X'r(t - a) = V2W 7 ^ a««r i a (t_,7) . (13.106) 

rc€<2odd 



Applying the top-sign version of (13.28) to X, we get 



m'.n'eZ odd 



} \ot m ' , 0t„/ 

L T T 



= 2jti — S(a-a'). (13.107) 

da 



To extract the commutators we apply the following integral operators to the left and right 
sides of equation (13.107): 



1 

2; r 



p2n 

Jo 1 



da e' 2 ' 



/» 2tt 

Jo ' 



— I da’ i 

2it 



m, n e Z 0 dd • 



(13.108) 
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' k * k^ 

Two functions e'i a and e l ^ a with k, k' e Z 0 dd and k + k' ^ 0 are orthogonal over the 
interval [0, 2n] because k + k' is an even integer. The integrations in (13.108) therefore 
select a single commutator on the left-hand side, and one can show that 

[«-,«-] = f« m+ »,o. (13.109) 

This is the expected form of the commutation relations. 

Quick calculation 13.4 Prove (13.109). 



Similar commutation relations hold for the right-moving oscillators and, as usual, left- 
moving and right-moving oscillators commute: 



L am, an 

2 2 



m o 

— 2 0 : 




(13.110) 



We must still implement the orbifold identification, so equation (13.87) must hold for 
the twisted X in (13.105). We can readily read off the action of U on the new oscillators: 



UaiU 

2 



-1 



-an 

2 



Uan u 

2 



-1 



-an 

2 



(13.111) 



The absence of a momentum zero mode in the expansion of X means that twisted states 
do not have a conserved momentum along the X direction. U -invariant ground states are 
labeled by p + and the transverse momentum p along the X' : 



twisted sector ground states: | p + , p ) . 



(13.112) 



These are not to be confused with the ground states (13.96) in the “untwisted” sector that 
have zero momentum along X. 

To discuss excited states in the twisted sector we need the relevant formula for a' M 2 . This 
time we can expect changes because the half-integer moding of the X oscillators can alter 
the ordering constant. To find out, we recall that in open string theory we anticipated the 
value of the ordering constant by performing the naive ordering of Lq and using the f- 
function motivated rule 1 + 2 + 3 + ■ ■ ■ — » Let us do the same now for the closed 

string operator L; J-. We see from the top equation in (13.36) that L t) is determined by 
contributions from ( X 1 + X 1 ') 2 , that is, (X ! + X 1 ') 2 + {X + X') 2 . Since the mode expan- 
sions of X 1 + X' 1 and those of X + X' are completely analogous, the formula for in 
(13.37) will be modified to read 

=\j2° ll p° l ‘-p + \ “*«-§■ (13.113) 

ktzffj oc id 
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Each of the twenty-three directions i in the first sum contributes an ordering constant of 
2 ■ (— p ) = — 2 ' 4 . The new contribution arises from ordering the second summand: 

- T ata k = - Y aka a- H — a a a 

9 ' 9 “9 9 9 “9 9 “9 



2 — ' 2 2 

&cZ odd 



ieZ odd 



^z 0 + dd 



^ or a 
"2 2 



1 



= > a_Affi H — > fat ,a_i 

^ — ' 2 2 2 ^ — ' L 2 2 

^ Z odd 



^z+ d 



= > or AO! A + 

' 2 2 4 



i !>• 



(13.114) 



^z+ dd 



^odd 



To evaluate this we need to calculate the “sum” of all the odd positive integers. This is done 
as follows: 

OO OO 

H k = E k+ E k = E k + 2 J2 k. (13.115) 

ir=l 



k=1 *eZ+, ke Z+ en *6Z+ d 



It then follows that 



OO . 

e *=- x >=^ 



(13.116) 



^z+ dd 



k=l 



We can now write the precise quantum form of (13. 1 13): 

I 00 | || 

L o = i“VV + E“-X- 23 '^+ E “-§“1 + 1 -Tv 



P=1 



^odd 



which we write as 



I . i «; _ . 

Lq = -a’ p l p‘ +N 1 - - — , N L = a'_ p a‘ p + E 

P=1 i6 ^odd 



a or A 
2 2 



(13.117) 



(13.118) 



A completely analogous formula holds for . 

Quick calculation 13.5 Show that [A- 1 , a_ 2 ] = % ol_± and explain why A 1 - is properly 
called a number operator. 

To write the mass-squared formula recall that \a' M 2 is equal to the sum of A 1 - plus N 2 - 
plus the ordering constants, as in (13.48). We thus have 

1 

2 



1 , 2 , - , 15 

-a'M~ — . 



(13.119) 



The twisted sector ground states \p + , p) have A ± = A 1 - — 0. They are tachyonic states 
with ^a' M 2 — — y- The first excited states are built using the lowest moded twisted 
oscillators: 



a_ia_i \p , p) , 

2 2 



1 



-a'M 2 = - + 



1 1 15 



2 2 



8 



(13.120) 
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The momentum labels on twisted states indicate that the wavefunctions are of the form 
i/f (r, /; + , p), or i//(r, x~ , x ) in coordinate space. Given the correspondence between wave- 
functions and fields, we conclude that the fields associated with twisted states have no x 
argument. These fields must live at some specific value of x. Given that energetics forces 
twisted states to localize around x — 0, we reasonably conclude that the fields live on the 
reduced 25-dimensional spacetime defined by x — 0. The above ground states and first 
excited states are associated with scalar fields because the states have no indices along 
directions in the reduced spacetime. 

Orbifolds always have a twisted sector thus orbifolding has a double effect - the so-called 
double strike. One effect is that states of the parent theory that are not invariant under the 
orbifold action are projected out. The other effect is that we gain a whole new sector of 
states that satisfy “twisted” boundary conditions. 



Problem 13.1 Commutation relations for oscillators. 

(a) Use the lower-sign version of equation (13.28) and the appropriate mode expansion to 
verify explicitly that the unbarred commutation relations of (13.29) emerge. 

(b) The set of functions e ' nry , with n e Z, is complete on the interval a e [0, 2n]. Use this 
fact to prove that 



(c) Compute explicitly the commutator [X 1 (x, a), V zJ (r, o')] using the mode expansions 
of X and V and the commutation relations (13.29), (13.30), and (13.33). Use equation 
(1) to confirm that the expected answer (13.27) emerges. 

(d) Prove the zero mode commutation relations (13.33), starting with a derivation of 



which is the closed string analog of equation (12.47). 

Problem 13.2 A projector into physical states. 

Consider the vector space H spanned by the set of states /,, X) in equation (13.60). Explain 
why, for any state |A, X) e H. the eigenvalue of P — — L (l is an integer. Show that 



is a projector from Ti into the vector subspace where P — 0. Thus Vo projects into the state 
space of closed strings. 



Problems 




(l) 



^ Xq T V 2a ' r , X J (r, o') 
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Problem 13.3 Action of Lq — Lq . 

(a) Prove that equation (13.59) holds for finite ao . You may find it useful to define f (a o) = 
e ~ lPa o X 1 (r, a) e' Pao and to calculate multiple derivatives of /, evaluated at ero = 0. 

(b) Explain why 

e ~ iPa ° (X 1 ±X I ')(T,o)e iPa ° = (X 1 ±X i ')(t,o + c r 0 ). (1) 

(c) Use equation (1) to calculate e~' Pa ° a f, e' Pa ° and e~ lPa ° a ] n e' Pcr ° . In doing so, you 
are finding the action of a a translation on the oscillators. 

(d) Consider the state 

\U) = a!_ m al„\p + , pr) , m ,n > 0 . 

Use the results of (c) to calculate e~ ,Pa °\U). What is the condition that makes the state 
| U) invariant under a translations? 



Problem 13.4 — L {) as world-sheet momentum. 



(a) Use equations (13.36) to show that 



Also prove that 




T -L _ J 4- 




2jt 



2 7T 



da X 1 X 1 ' . 



da 



dX~ 
3 a 



(1) 



which explains that — L {) vanishes classically because X~ , just like any other 
string coordinate, must satisfy the closed string periodicity condition. 

(b) The dynamics of the transverse light-cone string coordinates is governed by the 
Lagrangian density (12.81): 



c = —(x I x I -x I 'x I '\. 

Ana.' \ / 

Show that the infinitesimal constant a translation SX 1 = e d a X ! is a symmetry of 
C in the sense of Problem 8.10. Calculate the associated charge and show that it is 
proportional to Lq — , as given in (1). 



Problem 13.5 Unoriented closed strings. 

This problem is the closed string version of Problem 12.12. The closed string X 1 ' (r. a) 
with a e [0, 2n] and fixed r is a parameterized closed curve in spacetime. The orientation 
of a string is the direction of increasing a on this curve. 

(a) Consider now the closed string X ii (t. 2n — a) with the same r as above. How is this 
second string related to the first string above? How are their orientations related? Make 
a rough sketch, showing the original string as a continuous line and the second string 
as a dashed line. 
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Introduce an orientation reversing twist operator Cl such that 

Cl X 1 (r, o) Cl~ l = X 1 (r, 2 it — a) . (1) 

Moreover, declare that 

f2X(j“ f2 _1 = x^ , Cl p + f2 -1 = p + . (2) 

(b) Use the closed string oscillator expansion (13.24) to calculate 

£2xq£2~ 4 , Cloth f2 _1 , and Cloth ■ 

(c) Show that A _ (r, er) f2 _1 = X~(r, 2 : r — er). Since £2 X + (r , er) £2 _1 = X + (r, 2jt — 
er), equation (1) actually holds for all string coordinates. We say that orientation 
reversal is a symmetry of closed string theory because it leaves the closed string 
Hamiltonian H invariant: ClHCl~ l = H. Explain why this is true. 

(d) Assume that the ground states are twist invariant. List the closed string states for N 1 - < 
2, and give their twist eigenvalues. If you were commissioned to build a theory of 
unoriented closed strings, which of the states would you have to discard? What are the 
massless fields of unoriented closed string theory? 

Problem 13.6 Orientifold Op-planes. 

An orientifold Op -plane is a a hyperplane with p spatial dimensions, just as a Dp-brane 
has p spatial dimensions. The Op-plane arises when we perform a truncation which keeps 
only the closed string states that are invariant under the symmetry transformation which 
simultaneously reverses the string orientation and reflects the coordinates normal to the 
Op-plane. 

For an Op-plane, let x , . . x p be the directions along the Op-plane, and let 
x p+1 , . . ., x d with d — 25 be the directions orthogonal to the Op-plane. The Op-plane 
position is defined by x a = 0 for a — p + \ ... ,,d . We will organize the string coordi- 
nates as X + , X~ , {Z'J , {X a } with i = 2, . . p, and a = p + 1, . . ., d. Let Cl p denote 
the operator generating the transformation 

Cl p X a {T,a)Cl- x = -X a (j,2n -a), (1) 

Cl p X‘ (t, a) Cl~ x = X' (x, 2tc — a) . (2) 

Moreover, assume that 

Cl p x-Cl- p '=x~, Clp p + Cl~ l = p + . (3) 

(a) For an 023-plane the two normal directions x , x 25 can be represented by a plane. 
A closed string at a fixed r appears as a parameterized closed curve X a (r , er) in this 
plane. Draw such an oriented closed string that lies fully in the first quadrant of the 
(x 24 , x 25 ) plane. Draw also the string X a (r, er) = -T fl (r, 2 jt — a). 
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(b) For any operator O the action of Q p is defined by O — > Q. p OQ. p 1 . Use the expansion 
(13.24) to calculate the action of £l p on the operators 

P ° > <> x 0 ’ P‘> <> and a,',. 

Show that 2f ± (r, er) £2“ 1 = X ± (r, 2 tt — cr). Explain why the orientifold transfor- 
mations are symmetries of closed string theory. 

(c) Denote the ground states by \p + , p l , p a ). Assume that the states \p + ,p',0) are 
invariant under Q p : £l p \p + , p ' , 0 ) = \p + , p' , 0) . Prove that 

tt p \p + ,p', p a ) = I p + ,p',-p a ). 

Assume that, for every pair of conjugate coordinate and momentum operators ( q , p), 
the momentum states are defined to satisfy \p + Sp) = exp(iSp q)\p), with Sp a 
constant. 

The massless states of the oriented closed string theory are given by 

1$) = J dp + dp l dp a p + , p l , p a )(a I _ l u J _ l ±a J _ l a I _^\p + , p l , p a ), 

where <t>^y are wavefunctions, and the transverse light-cone indices /, J run over all the 
values taken by the indices i and a. 

Now consider the truncation to Q p invariant states: |<t>) = |<t>). The resulting string 

theory is the theory in the presence of the orientifold Op-plane. Intuitively, for an invariant 
state the amplitudes for the string to lie along each of the two related curves of part (a) are 
equal. 

(d) Find the conditions that must be satisfied by <J>^, <J>^, to guarantee Q p invariance. 
All of the conditions are of the form 

P + > P'- P°) = ■■■ $ />( T ’ P + > P l ’~P a ) - 
where the dots represent a sign factor that you must determine for each case. 

Remarks: in coordinate space, letting x m = {a- 0 , . . ., a p } , the above invariance conditions 
require that 

<t>fj(x m ,x a )= <t>fj(x m ,-x a ). 

In the presence of an orientifold plane, the values of the fields at (x m , x a ) determine the 
values of the fields at the reflected point (a™ , — x a ). The fields are either even or odd under 
x a — > —a" . The orientifold plane is some kind of mirror that relates the physics at reflected 
points, effectively cutting the space in half. In one half of the space, and away from the 
orientifold, there are no constraints, so one has the full set of fields of oriented closed 
strings. An 025-plane is space filling. Since it has no normal directions, the orientifold 
symmetry includes only string orientation reversal. This case was studied in Problem 13.5. 



r 

14 A look at relativistic superstrings 

L 



Realistic string theories must contain fermionic states, like the states of electrons 
or quarks. Superstrings include anticommuting dynamical variables in addition 
to the commuting coordinates X 11 that describe the position of strings. For open 
superstrings, quantization gives a state space with a Neveu-Schwarz (NS) sector 
that contains bosonic states and a Ramond (R) sector that contains fermionic 
states. The theory has supersymmetry, a symmetry that ensures that the number 
of bosonic and fermionic degrees of freedom are the same at any mass level. We 
examine type II closed string theories, which arise by tensoring the state spaces 
of open superstrings. 



14.1 Introduction 



We have so far studied bosonic string theories, both open and closed. These string theories 
live in 26-dimensional spacetime, and all of their quantum states represent bosonic particle 
states. Among them we found important bosonic particles, such as the photon and the 
graviton. Non-Abelian gauge bosons, needed to transmit the strong and weak forces, also 
arise in bosonic string theory, as we will see in Chapter 15. 

Realistic string theories, however, must also contain the states of fermionic particles. 
You may recall that a quantum state of identical bosonic particles is symmetric under the 
exchange of any two of the particles. A quantum state of identical fermionic particles, on 
the other hand, is antisymmetric under the exchange of any two of the particles. Quarks 
and leptons are fermionic particles. To obtain them we need superstring theories. We will 
not study superstrings in detail in this book. A proper explanation would require a detailed 
discussion of spinors and the Dirac equation in various numbers of dimensions, as well 
as other technicalities. This would take us too long. Here we give you a brief discussion 
of the basics of superstrings, enough to appreciate that they are natural generalizations 
of the bosonic string with some interesting new ingredients. Certain applications that are 
discussed in this book involve superstrings; this chapter provides the required background 
material. 

The superstring spectrum contains no tachyon. As a result, the theory of open super- 
strings can describe D-branes that are stable. Since bosonic string theory D-branes are 
always unstable, this affords new and interesting possibilities. In addition, superstrings 
have supersymmetry. Supersymmetry is a symmetry that relates the bosonic and fermionic 
quantum states of the theory: in a superstring theory we find an equal number of fermionic 
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and bosonic states at every mass level. If supersymmetry exists in nature it must be spon- 
taneously broken: we do not observe degeneracies between fermions and bosons. Many 
physicists believe that supersymmetry is an attractive candidate for the physics that lies 
beyond the presently established Standard Model of particle physics. 



14.2 Anticommuting variables and operators 



We describe the position of classical bosonic strings using the string coordinates X^ir. o). 
The X^ix, g) are classical commuting variables: products of them are independent of the 
order of the factors. In the quantum theory the X 1 ' s become operators that do not generally 
commute. The failure of two operators A and B to commute is encoded in the commutator 
[A, B] = AB — BA. 

To get fermions in string theory we introduce new dynamical world-sheet variables 
i (r, er) and ^ (r, cr). The classical variables (/r„ (a = 1,2) are not ordinary commut- 
ing variables, but rather anticommuting variables. Since this is an important concept let us 
digress. 

Two variables b i and hi are said to anticommute with one another if 

b\b2 — —b2b\. (14.1) 

If b\ and hi are anticommuting variables, more is true. The variables must anticommute 
with themselves. Thus b\b\ — —b\ h \ , which means that 

b\b\ = 0, b 2 b 2 = 0. (14.2) 

For classical anticommuting variables the order of factors is important. If we have a set of 
anticommuting variables bj indexed by i , then we have 

bibj = -bjbi, Vi, j. (14.3) 



Quick calculation 14.1 Verify that the matrices y 1 and y 2 defined by 



Y = 



0 -1 

1 0 



k 2 = 



0 1 
1 0 



anticommute, but are not anticommuting variables. 



(14.4) 



In a quantum theory classical anticommuting variables become quantum operators that 
can sometimes fail to anticommute. Given two such quantum operators f\ and / 2 , the 
failure to anticommute is measured by the anticommutator {f \ , /o}, defined by 

i/i , f 2 \ = f\ f- + hf\ • (14.5) 

If two quantum operators anticommute, their anticommutator is zero. 

We can explain what anticommuting operators have to do with fermions. Recall that 
the quantization of a scalar field, for example, gave us creation and annihilation operators 
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that create particles. The creation operators a' p all commute among themselves and help us 
construct the multiparticle states 

(a f pi ) ni (a^)" 2 . . . (14.6) 

that contain n\ particles with momentum p \ , n 2 particles with momentum p 2 , and so on. 
The rij s are arbitrary positive integers. 

To describe the relativistic electron and its antiparticle, the positron, one uses the clas- 
sical Dirac field, a classical anticommuting field variable. The quantization of the Dirac 
field gives rise to creation and annihilation operators for electrons and different creation 
and annihilation operators for positrons. For convenience, let us focus on electrons. The 
creation operators f ps are labeled by momentum p and spin .v. Electrons are spin one-half 
particles, so the spin label can only take two values. All the electron creation opera- 
tors are anticommuting variables and they all anticommute. In particular, this means that 
fp,sfp,s = 0, for any p and s. A multiparticle state of electrons takes the form 

< R7 > 

and describes a state with an electron with momentum pi and spin ,V| , an electron with 
momentum p 2 and spin S 2 , and so on. Note that we cannot get a state with two electrons that 
have the same momentum and the same spin because fl, s fp,s I £2) =0. The anticommuting 
creation operators automatically implement Fermi’s exclusion principle. In fact, if we use 
a wavefunction to create a superposition of states 

X! f d P P2,s 2 ) fp usi fj, 2 ,s 2 \tt)’ ( 14 - 8 ) 

■S1.S2 

we can quickly conclude that only the part of the wavefunction xj/ that is antisymmetric 
under the simultaneous exchange p\ -o- pi and si -o- s 2 contributes to the above state. 

Quick calculation 14.2 Prove the above claim. 



14.3 World-sheet fermions 



As we mentioned before, classical superstrings require anticommuting dynamical variables 
i/r„ (t, a), with a = 1,2. Recall that for each value of /z the dynamical variable X ,l (t. <j) 
is a world-sheet boson. As it turns out, for each /z, the two components and 1 //^ com- 
prise the variables needed to describe a fermion on the (r, er) world, that is, a world-sheet 
fermion. Remarkably, the quantization of such objects results in particle states that behave 
as spacetime fermions, which is what we need. 

In light-cone quantization X + was set proportional to r and X~ was solved for in terms 
of other quantities. With superstrings this remains true but, in addition, the light-cone 
gauge condition sets ij/+ = 0 and allows one to solve for xj/~ . Both X~ and 1 fr~ receive 
contributions from the transverse X 1 and xj/jj . Since both X 1 ’ and \j/a are spacetime Lorentz 
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vectors, both enter into the definition of the light-cone Lorentz generator M~ l . It fol- 
lows that the commutator [M ~' , M~ J ] receives contributions from both X 1 ' and i /r„ . The 
requirement that this commutator vanishes gives constraints different from those obtained 
for bosonic strings. The number of spacetime dimensions is no longer twenty-six, but rather 
ten. The downward shift of the mass-squared equals minus one-half, rather than minus one. 

In the light-cone gauge we need only concern ourselves with the transverse fields 
V^(r, a). To study their quantization it is convenient to have an action, denoted as S,i , , 
that describes their dynamics. This action is the fermion analog of the action (12.81) that 
describes the dynamics of the transverse coordinates X 1 . So, all in all, the action S that 
describes the full set of degrees of freedom takes the form 

S=£—Jdrj da[x ! X 1 - X 1 ' X 1 '^ + S f , (14.9) 



with 



Sxlr = 



2 1 f dr £ da (3 r + d a )^( + ^ Or - 8*)^ 



(14.10) 



The action S f is the Dirac action for a fermion that lives in the two-dimensional (r, a) 
world. Note that each term in S f contains just one derivative. For bosons, like the X 1 , 
terms in the action contain two derivatives. A term in the Lagrangian that couples a field to 
itself and has just one derivative risks being an irrelevant total derivative. Indeed, consider 
an arbitrary field h and a coupling h(d z h). We have 



h(d T h) = d z (hh) - (3 T h)h, 



(14.11) 



or, equivalently. 



h(d T h) + (d T h)h = 3 T (hh). 



(14.12) 



If the field h is commuting, the two terms on the left-hand side are identical and, indeed, 
h(d z h) is a total derivative. If the field h is anticommuting, (3 z h)h = —h(d z h) and both 
the left-hand side and the right-hand side of the equation vanish. We learn that h(d z h), for 
h anticommuting, is not a total derivative. This means that the action .S',/, is nontrivial only 
because the x[r^ fields are anticommuting. 

Let us vary the fields i j/^ in .S',/, in order to find the equations of motion and the boundary 
conditions. We have 

SS * = £ f dr J dcr [ Or + 8 a )lfr\ l + ikl Or + 3 aWl 

+ 8^ (3 r - 9 a )ir{ + Or - 8a)Wl\. (14-13) 

Consider the second term on the first line of the above right-hand side: 

Vr 7 Or + 3,X Wl 7 = 3r(^ 7 + 3 a W[ 8f() ~ (Or + 3a)^/]«^i 

= 3 T (V f l W[) + dvifl Wl) + Or + d a )xk[- 



(14.14) 
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14.3 World-sheet fermions 



A similar calculation can be done for the second term on the second line of (14.13). Back 
into this equation and, as usual, ignoring the total derivatives along time, 

8S f =^Jd r ^ do^l (3 t + d a )i,[ + 8ir'jd T - 

+ ^ J dx^f[Syjf{ - 



0=71 

o=0 



We can now read immediately the equations of motion 

Or + do)^\ = 0, (3 t - da)^2 — 0’ 

as well as the boundary conditions 

r jr[ (t, cr*) 8\ j/[ (r, a*) - r /^2 (r, cr*) 8i\t[ (r, cr*) = 0, 
that must hold both at cr* = 0 and at cr* = n for all times. 



(14.15) 



(14.16) 



(14.17) 



We will use the above equations of motion and boundary conditions to discuss the quan- 
tization and the state space of the theory. Our analysis will not be complete. In particular, 
we will not discuss the anticommutation relations satisfied by the fields i j/ l a and, as a result, 
we will not justify the anticommutation relations of the corresponding oscillators. Never- 
theless, the discussion will illuminate the main features of the quantization, and all results 
should seem plausible. 

We begin by noting that the equations of motion (14.16) imply that i is right-moving 
and r/r, is left-moving: 

^/(r , cr) = 'kf (r -a), 

f I 2 (z,cj) = y I 2 (j +o). (14.18) 

Let us now consider the boundary conditions (14.17). What can we do to satisfy these 
conditions? A little thought shows that attempts to make each term in (14.17) vanish do 
not work. Try, for example, setting i// | / (r, 0) = 0. Our solution in (14.18) then implies 
*kf (r) = 0, for all r, which results in i (r, cr) = 0. 

The situation improves if we impose boundary conditions that relate ^ / and i/cf at the 
endpoints. For each cr* we take 

lAi ( r , cr*) = il/rj (r, cr*), (14.19) 

where the choice of sign is still to be determined. If the fields are so constrained at an 
endpoint, their variations must also respect this condition: 

(r, cr*) = ± 8^2 (t, cr*). (14.20) 

It then follows by multiplication of the last two equations that (14.17) holds for either 
choice of sign. 

Let us now discuss the signs. Since both i/r^ and i/r| appear quadratically in the 
action, their signs can be changed without physical consequence. This arbitrariness is used 
conventionally to demand that at the endpoint cr* = 0, 

l/r/Cr, 0) = 



(14.21) 
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Since we cannot change the sign of i/r 7 or r /r 7 without changing this condition, the choice 
of sign at the other endpoint is physically relevant: 

i/r 7 (T, tt) = ir/4 (r, 7r). (14.22) 



The full superstring theory state space breaks into two subspaces or, as they are typically 
called, two sectors : a Ramond (R) sector which contains the states that arise using the top 
choice of sign, and a Neveu-Schwarz (NS) sector which contains the states that arise using 
the lower choice of sign. The boundary conditions are better understood by assembling a 
fermion field T 7 defined over the full interval o e [— 7r, tt] : 



4 // (r, cr ) 



i/r 7 (r, cr) a e [0, tt ] 

1^2 (T, — <7 ) O e [— 7T, 0]. 



(14.23) 



This construction is reminiscent of (12.36), which defined a field over a e [—tt, 7r] for 
bosonic open strings. The boundary condition (14.21) guarantees that *P 7 is continuous at 
er = 0. Moreover, on account of (14.18), 4 /7 (r, cr) is a function of r — cr: 



4 /7 (r, cr) = xV - cr). 



(14.24) 



Finally, the boundary condition (14.22) gives 

VP 7 (t, tt ) = i jr[ (r, tt) — ii/r^ (r, it) = ±4> 7 (r, —tt). (14.25) 

We thus learn that a periodic fermion 'T 7 corresponds to Ramond boundary conditions and 
an antiperiodic fermion *P 7 corresponds to Neveu-Schwarz boundary conditions: 

4 /7 (r, tt) = +'k 7 (r, — tt) Ramond boundary condition, 

, , (14.26) 

rp (r, tt) = — 'P (r, —tt) Neveu-Schwarz boundary condition. 

Let us consider both cases in detail now. 



14.4 Neveu-Schwarz sector 



Since the Neveu-Schwarz fermion 'P 7 is a function of r — cr and changes sign when cr — > 
a + 2tt, it must be expanded with fractionally moded exponentials: 

^(t, cr) ~ K e ~ ir(z ~ a) , (14.27) 

reZ+l/2 



up to a normalization factor that will not enter our discussion. Indeed, for any r = n + \, 
with n integer, we have 



Jr(cr+2n) _ ira i(n+\)2n _ ira in _ 



(14.28) 



which guarantees that T 7 is antiperiodic. Since 'P 7 is anticommuting, the expansion coef- 
ficients bj. are anticommuting operators. Following our usual notation, the negatively 
moded coefficients b l _ X p, b^j 2 , b l _ 5 ^ 2 , • • ■, are creation operators, while the positively 
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14.4 Neveu-Schwarz sector 



moded ones b\, 2 , bLj, ^ 5 / 2 » ■ ■ • are annihilation operators. These operators act on a vac- 
uum, henceforth called the Neveu-Schwarz vacuum |NS>. These operators satisfy the 
anticommutation relation 



[b I r ,b J s } = 8 r+sfi 8 IJ . 



(14.29) 



Given that all creation operators anticommute with one another and square to zero, each 
h ] _ r can appear at most once in any state. Since the X 1 (t, a) are quantized as usual, we 
still have the a ! _ n creation operators. Thus, the states in the NS sector are of the form: 



NS sector: |A> = [~[ 0 lp+ ’ Pt) ■ (14 ' 30) 



7=2 n=l 



7=2 



Here the p r j are either zero or one. We have written the full ground state as a “product” 
<g> of the ground state |NS) for the b J _ r operators and the ground state \p + , pr) for the a l _ n 
operators. The order in which the b operators appear in the state does not matter when we 
consider a single state. Since all the bs anticommute, different orderings can only differ by 
an overall sign, and no new states are obtained. 

The mass-squared operator in the NS sector, before normal ordering, is given by 



M- = - 



+ ^ E rb -rti). 



(14.31) 



P + 0 



reZ+t 



We can use the heuristic method based on £ -functions to find the ordering constant “a” 
in M 2 - the constant that must be added inside the above parenthesis when the sums are 
replaced by their normal-ordered versions. For the bosonic a 1 oscillators we know that 
each coordinate contributes — ^ to a. We record this piece of information as 



as = 



1 

"24‘ 



(14.32) 



For the NS fermions the terms in M 2 that require reordering are 

l E rb -rK = \ 



^b-rbi-^D- 2)(i 






(14.33) 



In the first step we let r — > — r and in the second step we used the anticommutator (14.29). 
The sum over odd positive integers was evaluated in (13.116) and gives +j^. 
follows that 



It then 



\ E rb -r b 'r = \ J2 rb -r b r ~ ^ D 



2 ). 



(14.34) 



,__i _3 

r— 2 - 2- 



We have thus learned that each NS fermion (antiperiodic fermion) contributes to a the 
constant ans given by 



(14.35) 
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The full ordering constant for M 2 is therefore 

a = {D- 2 ){a B + «ns) = (D - 2)(-^ - = -(D - 2)^. (14.36) 

For D = 10 we find a = — ^ and therefore (14.31) gives 

1 CO 

M 2 = — + N ± ), with N 2 - = J2 a -p 0lI p+ ^rb-rbj-. (14.37) 

“ p=\ .-it 

r r— 2 , 2 ... 

N 1 - is a number operator that counts the contributions from the a 1 s and the b 1 s. 

Quick calculation 14.3 To test that N 2 - includes the fermionic contribution to the number, 
show that the eigenvalue of N 2 - on bL n bL r2 |NS), with r \ , r 2 > 0, is r\ + r 2 . 

We can now list the first few levels of states in the NS sector. Organized by the number 
eigenvalue or, equivalently, mass-squared, we have 

ol'M 2 = -|, N 2 - = 0 : |NS) <g> \p + , p T ), 

ol'M 2 = 0, N 2 - = \ : ^ 1/2 |NS) ® \p + , p T ), 

a' M 2 = l N 1 - = 1 : {«i l5 ^ 1/2 ^ 1/2 }|NS> ® | p+, p T ), 

a'M 2 =l,N x = \: {a'^bi 1/2 , bL 3/2 , bL l/2 b J _ l/2 bE l/2 }|NS> ® | p+, p T ). (14.38) 

The states with N 2 - — 0 have a'M 2 = — 1. The states with N 2 - = 2 are massless. There 
are eight of them, labeled by the vector index I . 

Quick calculation 14.4 How many states are there at N 2 - = 

It is useful to have an operator whose value on states is + 1 if the state is bosonic and — 1 
if the state is fermionic. This operator is usually called (— 1 ) F , where F stands for fermion 
number. This is reasonable, states with even fermion number are bosonic and states with 
odd fermion number are fermionic. To calculate (— l) 7 ' on any state we must first give the 
eigenvalue of ( — 1 ) on the Neve u-Schwarz ground states |NS) ® | p + , pr). Let us declare 
that number to be minus one, thus making the ground states fermionic: 

(-1) F |NS) ® | p + , p T ) = ~ |NS> 0 | p + , p T >■ (14.39) 

The eigenvalue of (— l) f on a state is equal to minus one times a sequence of factors of 
minus one, one for each fermionic oscillator that appears in the state. Thus, acting on the 
generic state (14.30) we get 

(-1) F \X) = (14.40) 

Operationally, this result follows if we take (— 1) F to anticommute with all of the fermionic 
operators 



{(-l) F ,b I r }=0. 



(14.41) 
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14.5 Ramond sector 



Consider again our list of states (14.38). Since fermionic oscillators contribute half- 
integers to N ■*“, states with integer /V ' must have an even number of fermionic oscillators. 
So all states with integer N L have (— l) f = —1; they are fermionic states. All states with 
half-integer N 1 - have an odd number of fermionic oscillators and therefore (— 1 ) F = +1. 
These are bosonic states, and they include the eight massless states on the second line of 
the table. 

The fermion or boson character of the states we have discussed so far is restricted to 
the (r, er) world-sheet, where the T fields are fermions. We will later discuss if the above 
states are fermions or bosons in spacetime. 



14.5 Ramond sector 



With Ramond boundary conditions (14.26) the field >h / is periodic and can be expanded 
using integrally moded oscillators: 

'T / (t ,o)~Y^ d n e ~ in(x ~' J) - (14.42) 

Since T 1 is anticommuting, the oscillators d l n are anticommuting operators. Again, the 
negatively moded oscillators d 1 ] , d'_ 2 , d ! _ 3 , . . ., are creation operators, while the positively 
moded ones d{, d!, , . . . are annihilation operators. The Ramond oscillators satisfy the 

anticommutation relation 

{d I m ,d J „] = & m+n , oS ,J - (14.43) 

As in the NS sector, the Ramond creation operators are all anticommuting and, as a result, 
can appear at most only once on any given state. 

Ramond fermions are more complicated than NS fermions because the eight fermionic 
zero modes d^ must be treated with care. It turns out that these eight operators can be 
organized by simple linear combinations into four creation operators and four annihilation 
operators. Let us call the four creation operators 



$1, $2, $3, $4- (14.44) 

Being zero modes, these creation operators do not contribute to the mass-squared of the 
states. Postulating a unique vacuum |0), the creation operators allow us to construct 16 = 
2 4 degenerate Ramond ground states. In fact, eight of these states have an even number of 
£ s acting on |0) and the other eight have an odd number of £ s acting on |0) . Explicitly, the 
eight states \R rl ), a = 1, 2, . . ., 8, with an even number of creation operators are 



states | R a ) : 



| 0 >, 

£t&|0), £i&|0), £i£ 4 |0>, fe?3|0>, &&I0), I3&I0), 
$1&$3$4|0). 



(14.45) 
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The eight states |/?«}, a = 1, 2, . . 8, with an odd number of creation operators are 



states \Rg) 



£i|0), &|0>, | 3 |0), ^4 1 0} , 

$tfc&|0), £l&&|0>, Il?3?4|0), & 3 &\ 0 ). 



(14.46) 



The states | R a } and | R„ ) comprise the full set of degenerate Ramond ground states, denoted 
as |Ra) with A — 1, . . 16. The Ramond sector of the state space contains the states 

9 oo 9 oo 

R sector: \X) = fl 11 Yl ^ d -m) PmJ I r a> ® \p + , Pt)- (14.47) 

7=2 n=l 7=2 m= 1 

Here the p m j are either zero or one. 

Just like in the NS sector, the Ramond sector has an (— l) f operator. This operator 
anticommutes with all the fermion oscillators, including the zero modes: 

{(-1) F , 4} =0, (14.48) 



and, additionally, we conventionally declare |0) to be fermionic 

(-l) f |0) = — 10>. (14.49) 



It thus follows that all eight |R fl ) states are fermionic and all | R„) states are bosonic. 

The mass-squared operator in the R sector, before normal ordering, is given by 

Ml = ^ {\ E «-7 K + 2 E nd -n <)■ ( 14 -50) 

/?/=0 7 igZ 

For the R fermions the terms that require ordering are 

\ E nd -n d h = “2 E nd n d -n 

n=- 1,-2,... ji=1,2,... 

= 2 J2 nd -n d n - U D ~ 2)(l + 2 + 3 + --.) 

n=l,2,... 

= 2 T. nd -n d n + YA {D ~ 2y (14 ’ 51) 

n=l,2,... 

We thus learn that the ordering contribution from a (periodic) Ramond fermion is 

cir = (14.52) 

This number is precisely the opposite of as- Since there are equal numbers of bosonic 
coordinates X 1 and Ramond fermions, their respective normal-ordering contributions to 
the mass-squared cancel out and (14.50) gives: 

M 2 = i J2( a -n a h + nd -n d 'n)- (14-53) 

n> 1 

This formula implies that all the Ramond ground states are massless. 



